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Abstract 



It has been shown in the past two decades that quantum devices can achieve useful in- 
formation processing tasks impossible in classical devices. Meanwhile, the development of 
quantum error correcting codes and fault-tolerant computation methods has also set down a 
solid theoretical promise to overcome decoherence. However, to-date, we have not been able 
to apply such quantum information processing tasks in real life, because the theoretically 
modest requirements in these schemes still present daunting experimental challenges. 

This Dissertation aims at reducing the resources required for various schemes related 
to simple and robust quantum computation, focusing on quantum error correcting codes 
and solution NMR quantum computation. A variety of techniques have been developed, 
including high rate quantum codes for specific noise processes, relaxed criteria for quantum 
error correction, systematic construction of fault-tolerant gates, techniques in quantum 
process tomography, techniques in bulk mixed state computation and efficient schemes 
to selectively implement coupled logic gates using naturally occurring Hamiltonians. A 
detailed experimental study of quantum error correcting code in NMR is also presented. 
The goal is to get ready tools and techniques that may apply to some useful candidate 
systems for implementing quantum computation in the near future. 
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Chapter 1 



Motivation 



1.1 Time frame 



The two important but distant subjects of quantum mechanics and the theory of computa- 
tion were first put together by Benioff, Feynman and Deutsch back in the mid 1980's |14, 



51, hS. In the following decade, many interesting results and intriguing possibilities were 



found, including various cryptographic and communication protocols |15, |T^, and com- 
putation problems which exhibit a gap between the computation power of classical and 



quantum models [46, |106(| . These were followed by an extended three-year- long excite- 
ment brought by the discovery of the fast factoring and search algorithms [|103| , 61 ] , refined 
notions of primitives of computation f 



12, 47, and the development of quantum 



error correcting codes |105, 108 1 which can be used to achieve reliable quantum computa- 



tion m, g, H, |7|, m, |9|, |104 |. 



The work described in this Dissertation started in April, 1996, in the peak of the "the- 
oretical excitement". Quantum information is a very interesting subject, not only because 
one can attempt to engineer faster computation devices, but one is facing challenges which 
will change our fundamental understanding of many subjects, including quantum mechanics, 
the theory of computation, cryptography, and information theory. 

Meanwhile, our theoretical development was not well matched by experimental progress. 
While theorists proved that quantum computation can be made arbitrarily reliable if the 
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elementary error probability is below some optimistic threshold value of about 10~^, and 
if coding are performed, taking about hundreds of quantum bits to encode one, experi- 
mentalists heroically prepared a couple of quantum bits and demonstrated non-trivial logic 
operations. It is believed that most of the obstacles are technical, and will be resolved with 
time. 



1.2 Overview of the Dissertation 

Most of the work in this Dissertation attempts to narrow the gap between theory and 
experiment. The focus is mostly theoretical. Various approaches are taken, which revolve 
around more efficient methods to make quantum computation reliable. One approach is to 
view quantum error correction as a trade-off between resources and reliability, and the goal is 
to minimize the resources needed to achieve a certain level of reliability. Another approach is 
to consider the trade-off between resources and the computation power. This is motivated by 
the surprising realization of solution NMR quantum computation towards the end of 1996. 
In this implementation, difficult experimental requirements are waived by various techniques 
which compromise experimental feasibility and the computation capability. Focusing on the 
subjects of quantum error correction and solution NMR quantum computation, techniques 
were developed in this Dissertation which may apply to a wide range of candidate systems. 
Specifically, the following results were obtained. 

1. Quantum error correcting codes for specific noise processes which achieve better rates 



than any codes correcting for general errors |31 



2. Relaxed sufficient criteria for quantum error correction which admit more efficient 



quantum codes [[78 1. 



3. A systematic construction of fault-tolerant logic operations which unifies known con- 



structions and enlarges the set of primitives ||115|] . 



4. An alternative recipe to perform quantum process tomography, the complete charac- 
terization of quantum processes useful for designing better codes and gates (unpub- 
lished) . 
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5. An efficient sclieme to perform selective logic operations using some non-selective 
evolution as a primitive 

6. An experimental realization of a quantum code in NMR which was a first demon- 
stration of noise reduction by coding |79]. A thorough characterization of systematic 



errors on the NMR system was also made. 

Contributions were also made in the development of various techniques used in NMR quan- 
tum computation, including a particular hybrid scheme for state labeling (unpublished), a 
modification of the quantum process tomography procedure to the NMR system and a 



study of the computation power of bulk systems with imperfect initial states |114|] . 

Results on quantum error correction for specific noise processes were obtained in col- 
laboration with I. Chuang, M. Nielsen, and Y. Yamamoto. Results on fault-tolerant gate 
constructions and the computation power of bulk systems were obtained in collaboration 
with X. Zhou and I. Chuang. The efficient scheme to perform selective logic operations 
was co-developed with I. Chuang, F. Yamaguchi, and Y. Yamamoto. Similar result was 
independently reported in The experimental study of quantum error correction was a 



joint effort with L. Vandersypen, I. Chuang, X. Zhou, M. Sherwood, C. Yannoni, and M. 



Kubinec. Related result was independently reported in |41|. The ongoing study of quantum 
process tomography in NMR is done in collaboration with A. Childs and I. Chuang. 

This Dissertation is structured into three parts. 

• Part I contains the current overview chapter, and Chapter |2|, which reviews useful 
background for this Dissertation. Quantum circuits, universal sets of quantum logic 
gates, the formalism of quantum operations, quantum process tomography, and some 
common noise processing are covered. Item 4 is included as part of the discussion in 
quantum process tomography. 

• Part ^ focuses on quantum error correction and consists of three chapters. Chap- 
ter ^ reviews the major results in the development of quantum error correction. It 
covers classical error correction, the general theory of quantum error correction, the 
stabilizer formalism, fault-tolerant quantum computation and the threshold theorem. 
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Chapter Q describes quantum codes for specific noise processes and the relaxed crite- 
ria for quantum error correction. Chapter ^ describes the systematic construction of 
fault-tolerant gates. 



Part III focuses on NMR quantum computation, and also consists of three chapters. 



Chapter ^ describes the elements in NMR quantum computation. The problem of 
initial state preparation, process tomography, and bulk computation will be described, 
and some partial solutions will be given. Chapter ^ describes the efficient scheme 
to select specific coupling in NMR and related systems. Chapter ^ describes the 
experiment on quantum error correction. 

Review materials in Chapters |2| and ^ are primarily based on [^, 28, 83, 20, 95, 



89, 56, 96, 104 1 . Section 4.2.5| in Chapter ^ is based on |56|. The primitives described in 



Section are due to Bennett and Gottesman. Section |6.3.1| is based on |Q. Many results 
presented in Chapter ^ are due to Chuang. 

The problem in building quantum computation devices is not just decoherence, but 
rather the many simultaneous requirements of quantum information processing which con- 
tradict each other. It is important to (1) prepare the system in a fiducial initial state, to (2) 
perform a universal set of logic gates, to (3) implement measurement which will correctly 
read out the computational results, and to (4) maintain long coherence times. Throughout 
the past few years, more and more candidate implementations have been proposed, each 
presents some novel solution to part of the listed requirements and faces more difficulties in 
other requirements. It is the aim of this Dissertation to make ready some of the theoretical 
techniques which can be adapted to a wide range of candidate systems. 



Chapter 2 



Fundamental concepts 

2.1 The Basics 

The fundamental unit of classical information is a bit, a random variable that takes value 
on {0,1}. It can also be viewed as a 1-dimensional vector over Z2 (the integers modulo 
2). An n-bit classical string takes value on a set of 2" possible states. It is also an n- 
dimensional vector over Z2. In contrast, the fundamental unit of quantum information, a 
quantum bit or a qubit is a 2-dimensional unit vector over the complex field C. A state 
of n qubits is a vector over a 2"-dimensional Hilbert space which is a tensor product of n 
2-dimensional Hilbert spaces. Following the Dirac notation, a vector or a "ket" is denoted 
by |-). The dual of a vector or a "bra" is denoted by (-1. {|0), |1)} usually stands for the 
basis of a qubit. This basis is often called the computational basis. The state of an arbitrary 
qubit is given by a vector = a|0) + 6|1) with norm square (iplip) = 1 to represent some 
"total probability". An n-qubit state is likewise a vector over basis states with n-bit labels, 
following the tensor product structure of the n-qubit Hilbert space. The conjugate basis is 
defined as |±) = ^(|0) ± |1)). We may use 2-dimensional subspaces in a larger n-qubit 
space to represent qubits. To distinguish these subspaces from the constituent qubits of the 
larger space, we call the embedded ones "logical qubits" and the constituent ones "physical 
qubits" . 

The vector lip) is a pure state. As the overall phase is irrelevant, a pure state is more 
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precisely represented by a projector (■!/)!. In contrast, a mixed state is a distribution or an 
ensemble of pure states. For example, the state is {ipk) with probability pk- Mathematically, 
a mixed state is a convex combination of projectors. For example, the above distribution is 
represented hj p = ^kPk\'^k){'^k\ which is called a density matrix. This definition follows 
naturally if the expectation of an operator O is generalized from {^\0\i{j) to Tr(Op). It is 
immediate that any density matrix is positive with unit trace. 

In a usual quantum information processing task, one prepares some initial state, say \^) 
or p, and applies a sequence of unitary operations and measurements. A unitary operation 
U evolves to U\ip) and p to UpW. Unitary operations are quantum analogs of "gates". 
Quantum mechanics admits projective measurements of hermitian operators. Each measure- 
ment outcome is some eigenvalue of the measured operators. The measurement projects the 
original state onto the eigenspace corresponding to the measurement outcome. — > P\^) 
or p ^ PpP where the projector P depends on the outcome). Measuring cr^ (to be defined 
in the next section) projects {ip) = a\0) + onto |0) or We postpone the discussion 



on non-unitary evolution and generalized measurements until Section 2.3. 



2.2 Quantum circuits 

The quantum circuit is both a model and a representation of quantum information process- 
ing. We will discuss the usual conventions and introduce common circuit elements in this 
section. Consider the following example of a quantum circuit. 



|0) 



z 



-i H 



Figure 2.1: An example of a quantum circuit 



The following conventions are used throughout the Dissertation: 

• The horizontal and vertical directions schematically represent changing time and 
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space. 

• Time runs from left to right. 

• Horizontal single-lines represent quantum registers. 

• Input states are at the far left, and output states are at the far right. 

• Boxes enclosing capital letters represent gates. Some gates are given by more special 
symbols, such as the symbol connecting two registers with a ® on one and a • on the 
other. 

• The meter represents a projective measurement along the computational basis. 

• The double line represents classical information. An operation connected to a mea- 
surement box by a double line is performed conditioned on the measurement outcome 
being 

Wc now define sonic important quantum gates: 

The Pauli matrices are defined in the computational basis as: 



(2.1) 








1 




-i 




1 








ay = 










1 





i 




-1 



where the basis is ordered as (|0), We often use the shorthands 
X = aj;, Z = a:,, and Y = XZ 



-ia„ 



-1 

1 



(2.2) 



"/" stands for the identity matrix (2x2 unless otherwise stated). We may use o"o,i.2,3 to 
stand for I,ax,y,z- For a system of n qubits, a^/^ denotes a ai acting on the j'-th qubit (and 
an / acting on every other qubit). The Pauli group over n qubits, Qn, is generated by o"!"'^ 
for z = 1, 2, 3 and j = 1, - ■ ■ ,n. We may refer to the smaller real subgroup generated by X 
and Z also as the Pauli group, in which case the usage will be made explicit. 

For a fixed unitary operator U, the mapping O — ^ UOU^ is called the conjugation by 
U. We say that U takes O to UOU^. 
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The Clifford group consists of operations which take Gn to Gn- Since conjugation is 
reversible, the Clifford group is a subgroup of the permutation group of Gn- The Clifford 
group can be generated by P, H, and cnot (see Section 5.8 in ||5^), defined as follows. 
• The phase gate P is a 7r/2 phase rotation which equals 



P 



1 

i 



(2.3) 



The Hadamard gate H is a ir rotation about the axis '~^{^ + 



H 



1 

71 



1 1 
1 -1 



(2.4) 



• The controlled-NOT (cnot) acts on two qubits, called the control and the target bits. 
When the control bit is |1), cnot applies an X to the target. If the first and second qubits 
represent the control and the target bits respectively, the cnot has matrix representation: 



10 
10 
1 
10 



(2.5) 



We may write CNOT^b where a and h are labels for the control and the target bits. The 
Clifford group over Gn is generated by P^'^\ H^'^'> and CNOTjj for i, j G {1, • • • , n}. 

There are many other important gates. The rotation operator of angle 9 about the axis 
fj is defined as 



Rn{9) 



-iefi-a/2 



COS — / — i sin — fj ■ a . 



(2.6) 



2 2 

Up to an overall phase factor, Eq. ( p. 61) represents the most general one-qubit unitary 
operation. Other gates of interest include the Toffoli gate (controlled-controlled-NOT), the 
Predkin gate (controlled-SWAp), and the vr/S gate = Diag(l, \/i). 



The circuit symbols for some of these important gates are shown in Fig. 2.2. 
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X 



Y 



z 



p 



m 

H 



m 

u 



Control 
Target 



U 



CNOT 



Controlled-C/ 



Toffoli gate 



Figure 2.2: Symbols for quantum gates (courtesy of |56]) 



We conclude this section with a discussion on the universality of quantum gates. It 
is known that classical circuits can be built with just Toffoli gates, or gates from the set 
{not, or, and}. These are examples of universal sets of gates for classical computation. 
Therefore, an arbitrary circuit can be built from certain elementary components. Similarly, 
a set of gates is universal for quantum computation if it generates a dense set in the set of all 
unitary operations. In other words, an arbitrary unitary operation can be approximated to 
arbitrary accuracy by compositing gates from the set. An important universal set of gates for 
quantum computation is the set of all single qubit operations and cnot. Another important 
universal set of gates is {cnot, P, H, Toffoli}. Note that the first set is continuous, while 
the second set is discrete. 

The proofs of universality are out of scope of this Dissertation. Interested readers can 

refer to the original papers [^, 45, 47, 31, |l^ or an informative review in | p8[ |. We 

outline the main ideas. An arbitrary dx d unitary matrix is a product of at most d{d— l)/2 

matrices each acting on two basis states. Each two-level gate on n qubits {d = 2") is 

a product of 0{n) operations each having (n — 1) control bits and one target bit. Such 

operations can in turns be expressed as 0{n) cnots and single qubit operations, proving 

the universality of the first set. Note that the continuous set generates all unitary operations 
^ This is reminiscent of decomposition of a permutation into swaps or transpositions. 
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exactly. In contrast, the discrete set generates a dense subset of the unitary operations. In 
particular, the discrete set can approximate any single qubit gates (and therefore any gates). 
This is based on two additional facts. First of all, each single qubit gate can be expressed 
as at most three rotations about two non-parallel axes. Second, iterating a fixed irrational 
angle rotation can approximate an arbitrary angle rotation about the same axis. This is the 
crucial step in which a continuum of gates is generated by a discrete set. The irrational angle 
rotation can be implemented using H, P and the Toffoli gate, completing the proof. This 
particular construction can approximate an arbitrary angle of rotation to accuracy e using 
~ 0{\/e) elementary gates. Consider a circuit of N gates taken from the first universal 
set. To approximate the circuit to an overall accuracy e, each gate has to be accurate to 
e/N |]l9|. Therefore, 0{N'^/e) gates from the discrete set are required. A more efficient 



construction by Solovay (unpublished) and Kitaev ||69[ requires only 0(A^log^(A^/e)) steps 
where c « 2 — 4. 

Finally, we remark that even though composition from the continuous universal set is 
exact and composition from the discrete set is approximate, gates are inevitably inaccurate 
in real life, making such distinction irrelevant. Even more counter-intuitively, the discrete 
set can be more useful for achieving accuracy when the elementary gates are imperfect, as 
we will see in Chapter |^. 

2.3 Quantum operations 

In this section, we extend our discussion on closed quantum systems and unitary evolution 
to open quantum systems and their evolution described by the formalism of quantum oper- 
ations. We will see that an open quantum system can be modeled as part of a larger, closed 
quantum system undergoing unitary evolution. Surprisingly this simple model is extremely 
general - it can describe any physically reasonable dynamics to be defined. Such formalism 
will be used to describe noise processes and non-unitary computation procedures which are 
the main subjects of this Dissertation. 



In Section 2.3.1, we will derive three equivalent approaches to describe open quan- 



tum systems: an axiomatic approach, a subsystem evolution or system-reservoir coupling 
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approach and an operator sum representation approach which has a stochastic process 
interpretation. These equivalent approaches describe a class of possible dynamics called 



quantum operations. In Section 2.3.2 , we will describe two methods to perform quantum 
process tomography, the procedure to completely characterize a quantum process. Some 
common noise processes are described in Section using the formalism developed. The 
traditional description of open quantum systems in terms of master equations is given in 



Appendix A.l 



Section 2.3.1 is based on |85, |2^. The interpretation of a crucial result in [28| is original 



Section |2.3.2| is based on and original (unpublished) work. 



2.3.1 Equivalent approaches to quantum operations 



The common scenario depicted in Fig. 2.3 is a physically motivated example of quantum 
operations: 



p 












pe 







Accessible 
"system" 



Inaccessible 
"environment" 



Figure 2.3: Circuit for system-environment coupling 



We assume that the system and environment are initially in a product state p pe • Due to 
some system-environment interaction, the density matrix of the combined system is evolved 
unitarily to pc = U{p pe)U^ ■ Suppose p' is the density matrix for the system when 
the environment is inaccessible {p' is called the reduced density matrix) . How does pc 
determine p'? For consistency, p' and pc should predict the same expectation value of any 
system observable O. Therefore, TiCs{Op') = Tr^e (O ^ I)pc ■ This is true only if 



P 



Tie \ UpcU^ 1 = TVe f [/(/) Pe)U^ 



(2.7) 
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where Tie denotes the partial trace ^ on the environment. The partial trace is equivalent 
to measuring the environment in an arbitrary basis and forgetting the result, so that the 
system is in a statistical mixture of states corresponding to the inaccessible measurement 
outcomes, p' in Eq. ( p.7| ) defines a mapping £ on the system density matrices, which is 
written as p' = £{p). 

The system-environment coupling scenario can be generalized to the first approach of 



quantum operations, considered as state changes in a subsystem (see Fig. 2.4). 



p 




p' 








pe 







Accessible 
"system" 



Inaccessible 
"environment" 



Figure 2.4: Circuit for system-environment coupling 

The system interacts with some ancillary system and some subspace labeled by "o" is 
discarded. We allow state changes which are conditioned on some measurement results in 
the discarded subsystem. Mathematically, the process is described as 



£{p)=TVo\U{p® Pa)U\l®Po) 



(2. 



where Pq is a projector acting on the discarded subsystem. The probability of having the 
final state is given by Tr((?(p)) < 1 which can be less than unity. The output density 
matrix is given by p' = 'j^^^^- It is important to absorb the normalization into p' to 
keep £ linear. Having described the approach of quantum operations based on subsystem 
evolution, we now show that such formalism is equivalent to two others. A summary of 



these three approaches and their relations is shown in Fig. 2.5 



^ Consider two systems A and B. The partial trace over system B is a linear operation on the composite 
system AB defined by Tr_B [ |ai)(a2| 8 \bi){b2\ ] = \a\){a2\ Tr(|6i)(62|) where \ai) and \bi) for i = 1,2 are 
vectors in A and B. 
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Subsystem 


evolution 


£ip) = Tvo 




Pa)U^ ■ {I Po) 






Operator sum representation 
£{p)=Y.kAkpAl 



Axiomatic characterization 

• £ linear 

• £ trace non-increasing 

• 8 completely positive 



Figure 2.5: Three equivalent approaches to quantum operations 



To elaborate on the highly non-trivial result depicted in Fig. |2.5| , we need to define 
some notations. Let 7ii and 7i2 be the input and output Hilbert spaces for the quantum 



operation, and B{Hi) be the set of bounded operators acting on Hi. Figure 2^ asserts that 
the three classes of mappings from B{TCi) to B{TC2) with the following characterizations are 
the same: 



1. Subsystem evolutions which are given by: 

£{p)=Tlo\U{p(S)Pa)U\l2(S)Po) 



(2. 



where p £ BiTii) is the input density matrix, pa G B{7ia) is the initial ancilla density 
matrix, I2 is the identity operator in B{TC2), and Po is a projector acting on the 
discarded system (labeled by "o") which is the complement of Ti.2 in Tii Tia- 



2. State changes described by some operator sum representation [101, 28, 75] 



£{p) = Y,AkpAl 

k 

where ^ AlAk < I 



(2.10) 
(2.11) 
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An input state becomes a mixture of output states each resulting from the action 
of an Afg. These are hnear operations from TCi to 7^2 which are analogous to a 
set of probable "events". The Af^ are sometimes called "Kraus operators". In this 
Dissertation, Af. are referred to as the operation elements of £. When the input state 
is p, the probability of the "event" A^ is given by Tr{Al.Akp). Equation (2.11) restricts 
the total probability to be no greater than unity. 

3. The set of mappings from B{7ii) to B(7i2) which satisfy three axioms: linearity, 
complete positivity and trace non-increasing. Complete positivity is defined by the 
following: 

Definition 1 : An operator A is positive if all eigenvalues are non-negative. 

Definition 2: A linear map M acting on operators is positive if A > ^ 
M{A) > 0. 

Definition 3: A linear map M on 7ii is completely positive if, for any 
ancillary Hilbert space 7ia and for all operators A on 7ii Tia, A > Q ^ 
{M. ®Xa){A) > 0, where Xa is the identity operation on BiTLa)- 

The physical significance of complete positivity is the following. A quantum operation 
takes a density matrix to another and therefore has to be a positive map. Moreover, 
if an input state is initially entangled with some ancillary system, and £■ acts on the 
system while the identity operation acts on the ancillary system, the density matrix 
representing the final combined state has to be positive. 

There are important implications of the equivalence. Any physically reasonable process 
(such as defined by the three basic axioms) can always be described by some operator 
sum representation or be modeled by some coupling with an external system followed by 
discarding part of the combined system. The fact that continuous distortion of a quantum 
state has an operator sum representation is important in quantum error correction, as we 
will see in subsequent chapters. 
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After examining the physical contents of the three approaches, we now proceed to prove 
their equivalence. The proofs are ordered as given in Fig. p.5|. 



Proof of (1): Suppose 



£{p) = Tlo \ U{p Pa)U\h ® Po 



(2.12) 



We want to find an operator sum representation for 8. We write pa = X); M\'i'i){H\ where 
and A/ are the eigenstates and the corresponding (non-negative) eigenvalues of pa- Let 
be the eigenvectors of Pq, ordered such that = Z^fcLi |jfc)(jfc|- Performing the partial 

trace on the basis Eq. ( p. 12 ) can be rewritten as: 



K 



Sip) 



Y.J2^^(jk\u[p0\ii){ii\\u^\jk) 

k=l I 
K 

^^^^AikpA]^ where Aik = {jk\U\ii) 



(2.13) 
(2.14) 



k=l I 



Note that Aik = {jk\U\ii) is indeed an operator mapping B{Ti.i) to B{H2)- It remains to 
show Y.k=i T,i Ak'^ik < I- For any state 



K 



k=l I 



k=l I 



Tr(f(|V)(V|)) < 1 (2.15) 



where the last inequality is immediate from Eq. ( [2.12 ). This completes the proof that any 
subsystem evolution can be described by an operator sum representation. 

Proof of (2): Let £{p) = Y.k=iMpA be given, with ^f^^^^Afc < /. Let = 



/ — X^fcLi ^I^fc- Let TLa be some + 1 dimensional space with basis {|A;)}. Consider an 
operator U which acts on Tii ® Tia according to 



N 



k=0 



(2.16) 
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At this point, U may not be unitary. However, from Eq. (2.16) 



Y: [((V'i|4)(^i] [(^fc'i^2))i^') 

;fc'=0 
N 

^ {HAiA,,\^l^2){k\k') 



kk'=0 

N 
k=0 

(Vi|V2>(o|o) 



(2.17) 

(2.18) 

(2.19) 
(2.20) 



Therefore, U preserves the inner product on the subspace spanned by |?/')|0) and can be 
extended to a unitary operation on the whole space. 

Let P = J2Li\k){k\- Then, 



iA,,\mk')mAL){k'' 



Tra[ui\^){^\^\0){0\)UHl<g)P) 

N 

E i 

k,k',k"=l 
N 

k=l 



(2.21) 
(2.22) 

(2.23) 



Since Eqs. ( p.21 )-( p.23| ) hold for all pure states in TLi, they also hold for mixed states. 
Therefore, Eq. ( 2.21| ) is indeed a correct system-environment coupling model having the 
same dynamics as the given operator sum representation. 

Proof of (3): £{p) = J^k^kpAl. with J2k^k^k < I is obviously linear and trace non- 
increasing. Let p be a positive operator in B(7ii (gi 7ia) where 7ia is any finite dimensional 
Hilbert space. By hypothesis, {£ ®Ta){p) = Z^feC^fc ® ^a) P (^/c ® la)'^ ■ Since p is a convex 
sum of outer-products, {£ ®Za){p) is also a positive sum of outer-products and is positive. 
Hence, £ is completely positive. 

Proof of (4): This is the most non-trivial part of the result. This ingenious result was 



proved by Choi in |28]. The present proof is essentially the same proof, but re-casted in a 
completely different language. A related but slightly different proof can be found in Chapter 
3 of Is! 
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Suppose £ is a completely positive linear map from BiTLi) to B{Ti.2)- Our goal is to 



show that £ can be expressed as in Eq. (2.10). For concreteness, let be the dimension of 
Hi. 

Consider the mapping Z ® £ from BiTLi ® Tii) to B{'Hi ® 7i2)- In particular, consider 
the action of T on the following operator in B{Ti.i 'Hi): 



(2.24) 



where {|^)}j=i,...,ni is a basis for Hi- Note that Y = ni|$)($| where |$) is the maximally 
entangled state |<I>) = X]j N) ® N)- ^ is therefore positive. Y can be explicitly written 
as: 



Y 



1 













1 
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1 



(2.25) 



which is an ni x ni array of ni x ni matrices. The (i,i) block is exactly Hence, 
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• ■ ■ 



1 



which is an ni x ni array of n2 x n2 matrices. The block is exactly 



We now express (X ® £)iy) in a manner completely independent of Eq. ( 2.26| ). Since 
Y is positive and £ is completely positive, {Z ® £){Y) is positive, and can be expressed as 
iZ £){Y) = Z]r=i^ |ofc)(Q^fc|; the lofc) are the eigenvectors of {Z ® £){Y), normalized to 
\/Afc, where Afc are the positive eigenvalues of {Z (gi £){¥)■ We can write {Z ® £){Y) as: 
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(j®£:)(y) = Efc 



(il4 



(nil4 



^tl2) 



= Ak\ni) 



where the column represents |afc) and the row represents (a^.|. We divide the column into 
ni segments each of length n2, and define a matrix A^. with the i-th column being the i-th 
segment, so that the i-th segment is exactly Ak\i). Therefore, 



A.11>(1|4 


A,\l){2\Ai 




Ak\l){ni\Ai 


Afe|2>(l|4 


Ak\2){2\Ai 




Ak\2){ni\Al 










Ak\ni){l\Al 


Ak\m){2\Al 




Ak\ni){ni\Ai 



(2.27) 



Comparing Eqs. ( p. 261) and ( 2.27 ), £{p) = J2k^kpA\ Vp. Using this expression for £ and 
the fact that £ is trace non- increasing, J2k^k^k < I- This completes the proof. 
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We will primarily be using the operator sum representation approach for quantum oper- 
ations. We conclude this section with a discussion on the freedom of the operation elements. 
The operation elements are not unique for a quantum operation £. The classic example is 
thephase flip channel on a quhit, given hy£{p) = {p+ZpZ)/2= |0)(0|p|0)(0| + |l)(l|p|l)(l|, 
which turns all off-diagonal elements in p to 0. It means that randomly applying / or Z is 
the same as measuring the qubit without knowing the result! These two processes or in- 
terpretations are indistinguishable if we have no access to the agent coupled to the system. 
This example can be generalized: 

Theorem 1: Consider two sets of operation elements {Ak\ and {-B;}, where the 
Ak are independent. Then, the two sets generate the same quantum operation 
iff Bi = J2k uikAk where uik are entries of an isometric matrix. This is called 
the "freedom in the operation elements" for a quantum operation. 

Proof: Following the proof of (4), the theorem is the same as saying that 
Efc \ak){ak\ = J2i \bi){bi\ iff \bi) = J2k'^ik\ak), where |ofc) and A^ are related as 
in the proof of (4) and similarly for | bi ) and Bi . 

Sufficiency: If \bi) = Y.kUik\ak), then, Y.i\h){bi\ = Y.ikk' '^ikU*ik>\ak){ak'\ = 
T,kk'iT,i uikul,i)\ak){ak'\ = Efc \ak){ak\- 

Necessity: If J2k\'^k){o-k\ = J2i\bi){bi\, We consider the spans of {Ifl/t)} and 
{|6;)}. For any \a) which is orthogonal to all \ak), J2i{^\bi){bi\a) = by hypoth- 
esis, and I a) is orthogonal to all \bi). Therefore, the span of {\bi)} is in the span 
of {|afc)}, and \bi) = J2k'^ik\0'k) for some constants cik- Re-applying this to the 
hypothesis, Z^ifcfe' Qfccffc'|afc)(Q^fc'| = J2k\'^k){0'k\- Using the linear independence 
of |afc) (and thus that of |afc)(afc'|), J2i '^ikc'^j^i = Skk' and the are entries of an 
isometric matrix. This completes the proof. 

Note that we can similar argue that the span of is in that of Therefore a set of 

independent operation elements has minimal cardinality. Moreover, such independent sets 

are "canonical": every other set generating the same operation can be expressed in terms 

of the canonical ones by Theorem 

^ A matrix u is isometric if u^w = / (it^it = I <^ uu^ = /). Square isometric matrices are unitary. 
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The above proof also applies to the freedom in decomposing a positive matrix into a 
sum of outer products. The implication is that, different ensembles of states resulting in 
the same density matrix are indistinguishable, without access to the information outside 
the system. 

This degree of freedom in the operation elements is also manifest in the subsystem 
evolution approach. This is exactly the freedom in unitarily evolving the discarded system, 
which cannot affect the quantum operation as observed in the system only. The proof is 



straightforward, and we only state the result using Fig. ^ 

— £ip) 



p ■ 
|o). 



U 



p ■ 
|o). 







u 


- u - 



Figure 2.6: The degree of freedom in the operation elements, from the system-environment 
coupling approach. In the left, we construct a system-environment coupling model for {Ak} 
as described in the proof of (2), and retain all the definitions made before. In the right is 
the construction based on Bi = ^^uikA^. In other words, the dashed box is the evolution 
required if we apply the construction in the proof of (2) to the set {Bi}. 



2.3.2 Quantum process tomography 

Quantum process tomography is a procedure by which an unknown quantum operation of 
a system can be fully experimentally characterized. Complete determination of a quantum 
operation is important for characterizing our devices, such as quantum logic gates and 
quantum channels, as well as for understanding our noise sources which may lead to better 
correction techniques. 

Traditionally, such characterization revolves around semi-classical concepts such as cou- 
pling strengths, relaxation rates, and phase coherence times |8^, |5^. However, it is a simple 
exercise following the discussion in the previous section that there are 16" — 4" degrees of 
freedom in the quantum operations on an n-qubit system. Therefore, traditional techniques 
are insufficient for the purpose of quantum process tomography. 



Methods for performing quantum process tomography were first reported in |32, 94 1. 
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In this section, we summarize the method described in which is used in Part III. A 
completely different method will also be given, based on the proof of (4) in the previous 
section. This alternative method is original. 

The basic assumptions in quantum process tomography are as follows. The unknown 
quantum operation, £, is given as an "oracle" or a "blackbox" we call upon without knowing 
its internal mechanism. We prepare certain input states and measure the corresponding 
output density matrices to learn about £ systematically. The task to measure the density 
matrix of a quantum system is called quantum state tomography, and will be described in 



detail in Part |II|. We simply assume the ability to do so for now. We want to obtain an 
operator sum representation for £ to be used in our various applications. Therefore, the 
question is, how to convert experimental measurements on density matrices to knowledge 
of the operation elements? 



Method I @ 



Let the unknown quantum operation be £{p) = J2k ^kP^k- The crucial observation is to 
transform the information carried by the operation elements to the coefficients in the "x- 
representation" to be defined. For simplicity, the input and output Hilbert spaces are both 
equal to TC and have n dimensions. The procedure is as follows: 

1. Choose a fixed basis {Bm}m=i,---,n'^ for the operators acting on 7i. Express each 
as Ak = J2m CkmBm, and rewrite the operator sum representation as 

^(P) =Y1 CkmClnBmpBl = ^XmnBrnpBi (2.28) 
mnk ran 



where Xmn = J2k (^kmC%n- Equation (2.28) is called a ^-representation of £. Note that 



the information on £ lies with the coefficients Xmn rather than the fixed operators 

Bm- 

2. Choose a fixed basis {pi}^^i ... „2 for the density matrices. The pi correspond to 
physical states. 

3. For each i, obtain £{pi) in terms of pj in two different ways. 
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• Experimentally, prepare pi, apply £ and measure £{pi)- Express the outcome as 
£{Pi) = T.jPjKj- 

• Theoretically, express B^piBl = J2j i^ij"" Pj, so that £{p) = J2mnXmnBmpBl = 

Tlimn'^ij XmnPj- 

4. Comparing the experimental and theoretical results, and using the fact that {pj} is a 
basis, we obtain Xij = J2mn ^^ij^Xmn- Considering k as a matrix with double indices 
ij and mn, we have A = kx- Having found A experimentally, we invert k to obtain x- 

5. Note that x is positive by definition. Therefore, x = WDW'^ where W is unitary and 
D is positive and diagonal. Putting Xmn = Z^fc ^mkDkkWl^ into Eq. ( 2.28|) , we obtain 



an operator sum representation with operation elements — Zm ^^mk \^ -^kkBri 
Note that the resulting set of operation elements is canonical. 

Method II 



The second method follows almost immediately from the proof of (4) in Section 2.3. We 
retain all the notations defined in the proof of (4). The crucial observation is that Y and 
{I ^ £){Y) in the proof of (4) correspond to physical states \^){^\ and {I £){\^){^\) 
which can be prepared and measured. The procedure is to: 

1. Take two copies of TC and prepare the maximally entangled state \^). 

2. Subject one system to the action of £, while not doing anything to the other. 

3. Measure the combined output density matrix {I(g)£){\^){^\) = y^{l(Si£){Y), multiply 
by n and decompose as J2k \0'k){ak\- Divide \ak) (of length n^) into n equal segments. 

is the matrix having the i-th segment as its i-th column. 

This recipe is interesting because the mathematical proof that a quantum operation has an 
operator sum representation can be directly be translated to an experimental procedure for 
finding it. 



26 



CHAPTER 2. FUNDAMENTAL CONCEPTS 



Comparison 

Some interesting comparisons can be made on the two methods. Method I involves a x- 
representation as an intermediate step, which is unnecessary in Method II (the density 
matrix measured in Method II is directly related to the operation elements). Method I also 
involves a basis of physical states pi, which cannot be chosen to be orthonormal, and will 
complicate the analysis. In contrast, only one physical input state is required for Method 
II, which automatically contains the information for the unphysical orthonormal 

basis However, Method II requires the preparation of a maximally entangled state 

and the ability to isolate one of the two systems prepared. 

The two methods consume equivalent amount of resources, which is determined by 
the number of degrees of freedom in the quantum operation. In general, to measure an 
n X n density matrix, ensemble measurements are needed, requiring ~ Oiv?) steps. For 
method I, to determine n x n density matrices requires ~ O(n^) steps. For method II, 
to determine one x density matrix also requires ~ 0{n^) steps. In both cases, the 
number of steps is of the same order as the number of degrees of freedom in the quantum 
operation, therefore, both procedures are optimal in some sense. 



2.4 Common noise processes 

As an application of the quantum operation formalism, we describe some important noise 
processes using the operator sum representation. The traditional description of some of 



these processes using master equations is given in Appendix A.l. Noise processes are often 



called noisy channels, following the model communication problem: 



p 



£ 



Figure 2.7: A quantum channel 

We will consider primarily noise processes on a single qubit. In this case, the density matrix 
can always be written as 

p=hl + r-a) (2.29) 
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where r is a real vector in the unit ball and is often called the Bloch vector. In fact, the 
convention for single qubit rotation in Eq. ( |2.6D is chosen to correspond to the rotation of 
the Bloch vector. The effects of a quantum process £ can easily be visualized by comparing 
the Bloch vectors for p and £{p). 

Phase Damping 

Phase damping can be described by a channel in which a Z occurs with probability p G [0, 1]. 
The operator sum representation is given by 

£pDip) = il-p)p + pZpZ. (2.30) 

Consequently, 

{r^,ry,r,) ^ ((1 - 2p)r^, (1 - 2p)ry,r,) (2.31) 

resulting in the loss of coherence between different basis states. The maximum amount of 
decoherence occurs at p = 1/2. 



A |0) i |0) 




Figure 2.8: Trajectories of different points on the Bloch sphere under the effect of phase 
damping. Points move along perpendiculars to the z-axis at rates proportional to the 
distances to the z-axis. As a result, the Bloch sphere turns into an ellipsoid. 



The same process can arise from many different physical situations. For example, if the 
qubit is measured with probability 2p, 



£m{p) = {l-2p)p + 2p 



p|0)(0| + |l)(l|p|l)(l| 



(2.32) 
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which is just £pd{p) for < p < 1/2. Phase damping can arise from even more apparently 
different process. For example, if some random phase shift Pg = e~*^-^/^ occurs, the density 
matrix is evolved as 



a b 

b* c 



PepPl 



a be 

b*e'^ c 



(2.33) 



For a noise process in which is a random walk with independent and identically distributed 
steps, after a time t, the density matrix resulting from averaging over 9 is 



a be 
6*e-^* c 



-xt 



(2.34) 



for some constant A. 

The important implication is that, we may model a complicated real life process by 
much simpler ones without losing the essential physical content. We will see later how this 
fact is used in quantum error correction. 

Random Pauli channel 

In a random Pauli channel, an "error" X, Y or Z can happen each with probability p/3. 
The operator sum representation is given by 



£rp{p) = (1 - P)P + p/3 {XpX + YpY + ZpZ) 



(2.35) 



The random Pauli channel is also called the depolarizing channel, because £rp is same as 
swapping p with 1/2 with probability 4p/3! The result is a symmetric shrinking of the Block 
sphere towards the origin. 

In the generalized random Pauli channel, different errors can occur with different proba- 
bilities, in which case, the Block sphere still shrinks towards the origin, but asymmetrically. 
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Unital processes 



All the channels described so far fix the identity. These are examples of unital processes, 
defined by = I. Trace-preserving unital processes are the quantum analogs of the clas- 
sical doubly stochastic processes, and have many interesting properties [^]. For instance, 
the von-Neuman entropy of a state (defined to be Tr(/9log p)) can only increase under unital 
processes. Random unitary processes of the form 

£{p) = Y.PkUkpUl (2.36) 

k 

where Uk are unitary, are unital. The converse does not hold, except for qubit unital 



processes. An original proof is given in Appendix B.l. 

Non-unital processes play an important role in quantum information processing, because 
they provide the only means to reduce entropy and prepare initial states. 



Amplitude damping 

Amplitude damping is a process by which energy is lost to a zero temperature environment. 
It is defined by 

(2.37) 



£ad{p) = ^kpAl 

k=0,l 



Ao = 


1 


Ai = 


V7 




VI -7 








(2.38) 



where the energy eigenstates are chosen to be the computational basis states and |0) is 
the ground state. The energy exchange with an environment at finite temperature can be 
modeled by generalized amplitude damping^ defined as: 



£gad{p) = £ad{p) + V^-P X£ad{p)X . 



(2.39) 



where the parameter p is determined by the thermal energy ksT and the energy difference 

AE 

between the basis states, /\E, according to = c^bt _ Amplitude damping is the most 
important non-unital process known. 
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5 Summary 

• The fundamental unit of quantum information is a qubit: 
IV;) =a|0) oi p=lil + r- a). 

• The quantum circuit is a model of quantum information processing. 

• There are universal sets of quantum logic gates. 
Important universal sets include {e't'''^, cnot} and {H, P,cnot}. 

• There are three equivalent approaches to quantum operations: 

1. Subsystem evolution - £{p) = Tto U{p (g) Pa)U^ih <8) Po) 

2. Operator sum representation - £{p) = J^k^kP^k- 

3. Axiomatic - £^ is (i) linear (ii) trace non-increasing and (iii) completely positive 

• Quantum process tomography is a procedure to characterize an unknown quantum 
operation experimentally. 

• Common quantum channels include the phase damping channel, the Pauli channel, 
and the amplitude damping channel. Unital processes satisfy £{I) = I. 



Part II 

Quantum Error Correction 
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Chapter 3 

Theory of quantum error 
correction 



3.1 Introduction 

Quantum information processing is often described as a series of perfect unitary opera- 
tions and measurements on some physical system. Imperfections in the operations and 
interactions with the environment, resulting in gate and storage errors in the system, are 
inevitable in both classical and quantum information processing. Accumulation of errors 
will be detrimental in any large scale information processing. 

The problem of noise in classical computation and communication is resolved by sev- 
eral remarkable achievements: digitization of data, invention of reliable devices and fi- 
nally prudent use of redundancy (error correcting codes). At a first glance, generaliza- 
tion to the fragile quantum information is not just technically difficult but fundamen- 



tally impossible: It is impossible to clone arbitrary quantum information |112| perfectly. 
Moreover, quantum states and operators form continuous spaces, and it is impossible to 
detect infinitesimal errors. The much celebrated discovery of quantum error correcting 



codes 1 105, 105] came as a real surprise. The subsequent extension to achieve reliable com- 
putation with noisy components set a firm theoretical foundation for quantum information 
processing [0, ||, ||, |7|, 1^, ||, [lO^ . 
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The goal of this chapter is to cover the main ideas in noisy quantum coding, and to 
develop the language and background useful for the rest of the Dissertation. The develop- 
ment mentioned in this chapter also provides a context in which the original results in this 
Dissertation may fit. Original contributions from this Dissertation are presented from next 
chapter onwards. 

Elements in classical coding (taken from ||20| , |83| ) will be reviewed in Section |3.2| . The 
theory of quantum error correction and various code constructions is reviewed in Section |3.3|. 



Fault-tolerant quantum computation and the threshold theorem is reviewed in Section 3.4. 



3.2 Elements of classical Error Correction 
3.2.1 Fundamental concepts 

When information is stored or sent through an unreliable channel, it is possible to represent 
the information with redundancy to improve the probability of recovery. The set of all 
possible encoded messages (codewords) form an "error correcting code". 

We illustrate the basic concepts with a simple example. We consider the binary sym- 
metric channel, in which a bit b is flipped to b with probability p. This channel is symmetric 
with respect to 6 = and 1. 







1 








1 



Figure 3.1: The binary symmetric channel 

Errors in different uses of the channel are independent. Suppose p is small, so that multiple 
errors are unlikely. One can send b three times and majority-decode to obtain the output. 
Then, the output is correct unless at least two errors occur, which happens with probability 
p' = 3p^(l —p) +p^, which is less than p (an improvement) lip < 0.5. This scheme is called 
the 3-bit repetition code. It illustrates some general concepts in coding: 
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1. First, one must assume certain noise characteristics of the channel. For example, the 
binary symmetric channel with independent errors is used above. 

2. Given a channel, codewords are chosen from a space larger than required for single 
use, so that the likely errors take the original codewords to disjoint sets of received 
messages to ensure correct decoding. This is best illustrated for the 3-bit repetition 
code pictorially: 



001 



000 > 



on 



010' 



'/lOly 



100\ 



'ill 



110. 



Figure 3.2: The geometry of the 3-bit repetition code. The likely errors take the codewords 
000 and 111 to disjoint sets of messages defined by the dashed lines. 

3. Coding reduces the error probability at the expense of more uses of the channel. If 
M codewords are encoded in n bits, the rate of the code is (log2 M)/n. For example, 
the 3-bit repetition code has rate 1/3. 

Most of the time, we focus on independent errors with small probability p. Coding is 
designed to handle up to a certain number of errors. For example, the 3-bit repetition code 
is "1-error correcting" . Similarly, a t-error correcting code can correct t errors and it takes 
at least t + 1 errors to cause an overall failure. The overall probability of error is improved 
from p to 0{p*'^^). Coding entails a trade off between the rate and the rehability of the 
transmission or storage process, which is captured in a few parameters defined in the next 
section. 

3.2.2 The geometry of error correction 

Let the symbol set be a field F,\F\= q, and let messages of length n be vectors in F". We 
consider an arbitrary field for most of the time, and restrict to the binary field occasionally 
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to simplify notations. 

Definition 4- The Hamming distance between two vectors a and b, dH{cL,b), is 
the number of coordinates in which they differ. 

Note that the Hamming distance is a metric. We call the set {b : dnia, b) < r} the Hamming 
sphere of radius r centered at a. 

Definition 5: The distance of a code C is the minimum Hamming distance 
between any two codewords, d = min{(i//(a, b) : a b,a,b £ C}. 

Suppose no more than r errors occur. Each codeword is taken to a point within its own 
Hamming sphere of radius r. These Hamming spheres do not overlap if 2r < d, therefore 
the codeword can be decoded correctly, leading to Theorem ^. 




Figure 3.3: Hamming spheres of radius r for 2r < d. 
Theorem 2: A distance d code is t-error correcting where t = [^^J. 

Definition 6: An {n,M,d) code is a set of M vectors (codewords) in with 
distance d. n is called the block size of the code. 

Viewing an (n, M, d) code as a set of points in F" , the problem of coding is to pack as many 
points as possible while maintaining a certain distance between the points. 

The Hamming Bound is an important bound relating the distance and the rate of a 
code. For a message of length n, a Hamming sphere of radius r has V{r) = X]i=o (^) il ~ 
elements. For a t-error correcting code, Hamming spheres of radius t around the codewords 
do not overlap. Therefore, 



3.2. ELEMENTS OF CLASSICAL ERROR CORRECTION 



37 



Lemma 1 : (Hamming bound) Let C be an (n, M, d) code, then 

ri-log,M>log,y(L^J) (3.1) 

Codes which saturate the Hamming bound are called perfect. 

Error correcting codes can be used for error detection. The decoder will check if the received 
message is a valid codeword. For an (n, M, d) code, it takes d errors to avoid detection. 
Hence, an (n, M, d) code can detect d—1 errors. In general, an (n, M, d) code can correct 
t' errors and detect up to i + 1' errors for 2t + t'< d. 



We will describe a special class of codes, the linear codes, with many simplifying prop- 
erties. There is a beautiful relation between classical linear codes and a class of quantum 
codes. 

3.2.3 Classical linear codes 

Any error-correcting code C is a subset of F", the n-dimensional vector space over F. C is 
called linear if it is a subspace of F". In other words, 

1. C 7^ ^ (the empty set) (3.2) 

2. a,beC, a,l3eF ^ aa + pbeC (3.3) 

If dim(C) = fc, C is an {n,q'',d) code, or an [n,k,d] code ([•,-,•] is used for linear codes 
only). When q = 2, k bits are encoded in n bits. 

Definition 7 : The weight of a vector a is the number of non-zero components. 
It is denoted by WH{a) and it equals dH{a,0) (0 denotes the zero vector). 
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Theorem 3: The distance of a linear code C is equal to its minimum weight, the 
minimum of the weights of its non-zero codewords. 

A linear error correcting code can be defined by its generator matrix or parity check matrix. 
Let C be an [n, /c, d\ code. A k x n matrix G is a generator matrix of C if the rows of G 
form a basis for C (codewords are taken as row vectors). An original message u is encoded 
as V where v = uG. An (n — k) x n matrix H is a, parity check matrix for C iff, for any 
c G C, cH^ = 0. The parity check matrix represents the constraints that define the valid 
codewords. Alternatively, the rows of H form a basis for the orthogonal complement of C. 
Note that GH'^ = 0. 

The dual code of C, denoted by C"*-, is the orthogonal complement of C in F". is 
an [n,n — k,d-^] code with generator matrix and parity check matrix G^ . In general, 
there is no simple relation between d and d^. Counter- intuitively, a binary vector is self- 
orthogonal if it has even weight, and it is possible for C and C"*" to intersect. If a code is 
self-dual, d = d^ . If a code contains its dual, d"*" > d. 

The Hamming Bound for a linear [n, code is given by 

n-A;>log,y(L^J) (3.4) 

Another useful concept is that of a "syndrome" . Let C be an [n. A;, d\ code with parity 
check matrix H. Let v be the encoded message sent and r be the possibly corrupted message 
received. The syndrome of r is defined to be s = rH^ (an (n — A;)-vector). If e is the error, 
r = V ® e and 

s = rH^ = {vH^) e (eH^) = eH^ (3.5) 

Hence, the syndrome only depends on e but not on v. There is a 1-1 correspondence 
between errors of weight < {d — l)/2 and syndromes, providing a means for maximum 
likelihood decoding. 

A coset of a code C is the set of elements w(BC for w G F". An error e takes the original 
codeword space to the coset e © C Two cosets wi (BC and W2 (B C are equal iff wi,W2 are 
in the same coset. The cosets form a partition of F"'. In an [n,k,d] code, there is a 1-1 
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correspondence between syndromes and cosets. The element of minimum weight in each 
coset (a unique element of weight < (d — l)/2) corresponds to the most likely error occurred 
when the coset is identified. This is best captured by the standard array of C, which is a 
qfi-k ^ qk jjiatrix having the following format: 





HS{0) 


HS{ci) 


i/S(c2) • 




-l) 


c 







Cl 


62 






ei ® 


C 


ei 


61 © Cl 


61 © 62 • 


■ 6i © Cyfc 


-1 




C 




62 © 61 


62 © 62 • 


• 62 © Cqk 


-1 


ei e 


C 




6/ © Cl 


6/ © C2 • 


6/ © Cgfc_ 


-1 



Table 3.1: The standard array for an [n, fc, d\ code. I = ^ — 1 in the above. 

In the standard array, the rows are the cosets. Each error acts like a mapping from 
C to ei® C, and the z-th row is the image of C under e^. The j-th column is like the 
Hamming sphere around the codeword Cj (here the elements in a column may not form 
an exact sphere). The first column in each row is the most likely error conditioned on the 
corresponding syndrome. When r is received, maximum likelihood decoding can be done 
by reading out the column index. Alternatively, the syndrome can be obtained to find the 
row index of r and the most likely error e. The message sent, f , can be recovered by adding 
e to r, without knowing v. For concreteness, the standard array and the correspondence to 
the sydrome for a [5, 2, 3] binary code is as follows: 

The interpretation of the standard array and the second decoding method that requires no 
knowledge of v will become important in quantum coding, as discussed next. 

3.3 Quantum error correction 

As discussed earlier in this chapter, generalizing classical error correcting codes to the 
quantum case appears impossible because of the no-cloning theorem, the impossibility to 
measure and verify quantum states without disturbing them, and the difficulty in handling 



40 



CHAPTER 3. THEORY OF QUANTUM ERROR CORRECTION 



message 


00 


01 


10 


11 


syndrome 


code 


00000 


OHIO 


10011 


11101 


000 


coset 


10000 


11110 


00011 


01101 


Oil 


coset 


01000 


00110 


11011 


10101 


110 


coset 


00100 


01010 


10111 


11001 


100 


coset 


00010 


01100 


10001 


11111 


010 


coset 


00001 


01111 


10010 


11100 


001 


coset 


11000 


10110 


01011 


00101 


101 


coset 


10100 


11010 


00111 


01001 


111 



Table 3.2: The standard array for a [5, 2, 3] code, showing the 1-1 correspondence between 
the most hkely errors, the corresponding cosets and syndromes. 

a continuum of possible errors. In this section, we will describe the ingenious methods 
to adapt classical coding theories to construct quantum codes, despite all the apparent 
difficulties. We assume perfect logic operations, or equivalently the codes are designed to 
protect against storage or transmission errors but not operational errors. The latter will be 



discussed in Section 3.4. 



Out of the many beautiful results in quantum error correction, this section will describe 



the first quantum code due to Shor |105], an elegant necessary and sufficient criteria for 
quantum error correction due to Nielsen et al |^^, the CSS codes derived from classical 
linear codes due to Calderbank, Shor, and Steane \ 



1051, and the stabilizer formalism of 



quantum error correction due to Gottesman 56]. 



3.3.1 The Shor code 

How is quantum error correction possible? The answers are best illustrated by Shor's 9- 



qubit quantum code [105|, which is the quantum analog of the classical 3-bit repetition 
code. 

Suppose we encode the states as follows: 



|0) ^ |0l) = (|000) + |111))(|000) + |111))(|000) + |111)) 
|1) ^ lu) = (|000) - |111))(|000) - |111))(|000) - |111)) 



(3.6) 



where the overall normalization is omitted. The subscript L denotes the encoded or logical 
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states (see Section 2J.). An original superposition a\0) + 6|1) is encoded as a\OL) + b\^L), 
delocalizing the encoded information among the 9 qubits. 

This quantum code wih correct for any one-qubit error for the following reasons. We 
first show that it is capable of correcting X errors and Z errors on any one qubit. Without 
loss of generality, suppose the first qubit has an X error. This error can be revealed 
unambiguously when comparing the pairs of qubits (1,2) and (2,3). In fact, comparing 
the i-th and j-th qubits for G {(1, 2), (2, 3), (4, 5), (5, 6), (7, 8), (8, 9)} can reveal any 

single X error. Similarly, a single Z error can be found by comparing the signs in the first 
and second blocks, and in the second and third blocks. The Shor code can also recover 
a combined X and Z error on the same qubit, by performing both procedures, which are 
independent. Finally, a qubit suffering an arbitrary error is projected to having /, X, Z, 
or XZ errors during the "syndrome measurements" of the X and Z errors, and can be 
corrected. 

Many remarkable concepts are illustrated in this code. We provide an informal discussion 
in the rest of this section, followed by an abstract general framework for quantum error 



correction in Section 3.3.2. 



The syndrome measurements do not require measuring the qubits. For example, see 
Fig. 

Even more important is that, the syndrome can be found without gaining any in- 
formation on the encoded states. The unitary errors {X and Z) can be inverted 
without knowing the encoded information. This is analogous to the classical decoding 
method by projecting onto a coset in the standard array and inverting the most likely 
(minimum weight) error. 

Redundancy is used to embed the codeword space in a larger ambient space, so that 
correctable errors take the code space to orthogonal subspaces (the quantum analog 
of the Hamming spheres) which do not overlap. At any time, there is at most one 
copy of the state. 

Z errors acting on different qubits within the same block have identical effects. It 
is neither possible nor necessary to distinguish between these errors (interpreting the 
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|o) CD0 m 

|0) 




Figure 3.4: Syndrome measurement for the Shor code. The left circuit determines if an X 
error has occurred to any qubit in a block. The right circuit determines if an Z error has 
occurred in one of the three blocks. 



error as any one of the possibilities will do). A quantum code is nondegenerate if all 
the correctable errors can be identified unambiguously; otherwise, it is degenerate. 
The Shor code is a degenerate code. A more rigorous definition of degeneracy is given 
in the next section. 

The Shor code can correct for an arbitrary error that occurs to a single qubit. This 
is seemingly impossible, since the syndrome space is finite but the possible number 
of errors are infinite. The crucial observation is that, the continuum of errors can 
be discretized. To see this, suppose an arbitrary error E has occurred to a qubit. 
Since E can always be written as Cj/ + CxX + CyY + CzZ, the erroneous state is the 
superposition E\ipin) = CjlV'm) + c^X|^i„) + CyY\7pin) + CzZ\tpin). The syndrome 
measurements, which identify the I, X, Y and Z errors, project the erroneous state 
onto one of the four terms which are correctable. Non-unitary errors are correctable 
for the same reason. Finally, a general noise process can still be discretized using the 



operator sum representation introduced in Section 2.S. A more rigorous treatment 



will be given in Section 3.3.2. 
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• The Shor code is a concatenated code - it concatenates two 3-bit repetition codes, 
correcting the X and the Z errors at the lower and higher level codes respectively. 

Having witnessed how the many difficulties in quantum coding can be circumvent in the 
Shor code, we proceed to discuss the general theory of quantum error correction. 

3.3.2 General theory of quantum error correction 

In this section, we present algebraic conditions for a subspace C < to be a quantum code 



that corrects for certain errors acting on 7i. These conditions [49, 17, 89 1, often called 
the "criteria for quantum error correction", are of tremendous value in understanding and 
constructing quantum codes. They also allow notions in quantum error correction to be 



made rigorous. We take the approach using reversible operations |8S] 



Definition 8 : Suppose £ \s a, quantum operation. We say that £ is reversible on 
a space S if there exists a complete Q quantum operation IZ such that, \/p ^ S 

no£{p)=TT{£{p))p. (3.7) 

We require TZ to be complete because the reversal should be deterministic. It is important 
to consider possibly incomplete £. When Eq. (|3.7D is satisfied, Tv{£{p)) = p cannot depend 



on p because the left hand side of Eq. (3/7) is linear. 

We may connect the notion of reversibility with quantum error correction. Specifically, 
consider a complete quantum operation £ acting on 7i. For example £ can be independent 
error processes acting on individual qubits. For a general process £, a code C may not 
exist on which £ is exactly reversible. We interpret £ as a, stochastic process, with operator 
sum representation £{p) = J^neK. ^nP^n- Here, /C is the index set of A, the set of all 
operation elements An appearing in the sum. These An have natural interpretation as 
"errors", occurring with probabilities Tv{AnpAl^). Intuitively, a quantum code C should 
correct for the likely errors. 

To make these notions rigorous, we denote by Are C A the reversible subset on C, and 

let ICre = {'^l An £ Are} be the index set of Are- The reversible subset is defined so that 
^ Complete quantum operations are trace-preserving. 
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the possibly incomplete process £'{p) = J^neJCre ^^nP-^n is reversible on C in the sense given 
by Def. |8|. We say that the code C can correct for £' or the "errors" An S Are- The error 
correction criteria, expressed as algebraic conditions for reversibility, are as follow: 

Theorem 4 ■ Are is a reversible subset iff 

PcAlnAnPc = gmnPc ^rn, n € /C^e , (3.8) 
where Pc is the projector onto C, and gmn are the entries of a positive matrix. 
Remark: Eq. (|3.8| ) is often written as 

{ci\Al^An\cj) = SijOmn Vm, n G /C^-e Vi,j (3.9) 
where |cj) are logical states for the code. 

Proof: [Necessity] Suppose £' is reversible on C. Then, there is a complete 
operation 7^(-) = J2i P-l ' Pj and a constant p such that 

no£'{p)=pp Mp^C. (3.10) 

For all /9 G PcpPc £ C, Therefore, 

no£'{PcpPc) = Y. E RiAr^PcpPcAiR\ = pPcpPc (3.11) 

The last equality is between quantum operations, therefore, RiAnPc and ^/pPc 
are related by Theorem |l] in Section 2.3: 



iizA^Pc = VinVJ^Pc (3.12) 

where X^in l^'ZnP = 1- Using J^iPjPi = (since 7^ is complete) and Eq. ( |3.12| ), 
we can write 

PcAlAnPc = E PcAlnjRlAnPc = pY. PcvtmVlnPc = QmnPc (3.13) 
I I 

where Qmn = pJ2i '^im'^in are entries of the positive matrix g = pv^v. 
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[Sufficiency] Suppose Eq. ( |3.8D is true with g being a positive matrix. First 
of all, we can always change to another operator sum representation for £' 
to make the corresponding g diagonal: Since g is positive, there is some uni- 
tary matrix u, and positive diagonal matrix p such that u^gu = p. Therefore, 
T,mn'^km9mnUni = Pkhi where pk = Pkk- Together with Eq. (|3^), we obtain: 



Y,ul^{PcAlAnPc)uni = PkSkiPc (3.14) 

mn 

(PcAlAiPc) = PkSkiPc (3.15) 

where Ai = J2n'^nUni- Moreover, by Theorem {Ai} and {An} generate 
the same quantum operation £' . The desired new operator sum representation 
£'{p) = J2i AipA\ is canonical. 

We now construct a recovery operation TZ to demonstrate reversibility of £' on 
C. Using the diagonal elements of Eq. ( ^.15 ), the A^ have polar decompositions 



AnPc = VKUnPc Vn G ICre , (3.16) 

where the Un are unitary. Using the off-diagonal elements of Eq. ( 3.15| ) 



PcUlUmPc = 5nmPc ■ (3.17) 



Equations ( p. 16 ) and ( p. 17 ) correspond to the well known non- deformation and 



orthogonality conditions for quantum error correction. Equation ( 3.16| ) means 
that C should be chosen so that the correctable errors act like unitary processes 
on C. Equation (|3T^ ) says that the correctable errors should map C to or- 
thogonal subspaces. Using the projectors UmPcU}n onto these subspaces, the 
corresponding errors Am can be identified, and be reversed by U}^- 

Mathematically, the recovery operation TZ is given by: 



7^(p)= RkpRl + PEpPE^ 



(3.18) 
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where Rk = PcUl is the appropriate reversal process for each error Ak {k G /Cre), 
and Pe = 
any p £ C 



and Pe = I — J2k€JCre ^kPc^l is required for completeness. We check that, for 



£'{p) = J2 ^iPcpPcA] (3.19) 

= E PiUiPcpPcU} (3.20) 

no£'{p) = PiPcUlUiPcpPcUlUkPc (3.21) 

= E PIP (3-22) 

= T^{£'{P))P (3.23) 



where we have used Eq. (|3l7D to derive Eq. { ^2^ and Eq. ( ^^201) to obtain 



Tr(<S'(p)) = J2i£iCrePi- ■ This concludes the proof of sufficiency and the theorem 
itself. 

The code is non-degenerate iff Rank((5r) = \JCre\- Therefore, a code is degenerate iff p„ = 
for some n. The present definition contrasts with the more commonly used definition that 
the code is non-degenerate iff g is diagonal, which is not satisfactory since it is not invariant 
under different choices of the operator sum representation. The present definition captures 
the idea that a code is degenerate if Pc-A-n are linearly dependent, and that not all errors 
An are relevant on C. 

Note from the proof of the theorem that, if a code C corrects for the errors A^ G A^e, it 
corrects for any noise process with operation elements which are linear combination of the 
AnS. For this reason, a quantum code that corrects for X, Y, and Z restricted to t qubits 
can correct for any error on t qubits. 

We now define the quantum analog of the probability of successful recovery. We need 
to quantify how close is one state from another. For a pure input state {ip) and arbitrary 
output state p, we define the overlap fidelity between and p to be 



F(|V'),p)=Tr(|V')(V|/o) = (^IHV') 



(3.24) 
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which is the probabihty of projecting p onto the span of {ip). The fidehty for a channel 
£ quantifies how well an arbitrary input state is being preserved. The minimum overlap 
fidelity for £ is defined as 

J^£ = mmF{\^),£{\^l^){m (3.25) 

W 

which is the worse case input-output overlap fidelity. 

For a given noise process £ and a code C, we can quantify the effectiveness of C using 
the improved fidelity !FiioS- ^TloS is lower bounded by P'^^* = J^neJCreP"^^ which is the 
total detection probability for the reversible subset. This is because 

TnoS = min[F{m,no£'{\i;){i;\))+Fi\i;),no£''{\i;){i;\))] (3.26) 
> rninF(|V^),7^o^:'(|V>)(V|)) (3.27) 
= P'^"* (3.28) 

where £"{p) = J2ic-lCre -^nP-^n denotes the complementary process which cannot be re- 
versed. To achieve a desired fidelity T^., we have to include in Are a sufficient number of 
highly probable errors An so that T-jioe > P'^''^ > ^d- Thus, A re is also a high probability 
subset. 

Having covered the general theory of quantum error correction, we turn to the issue of 
finding codes. 

3.3.3 Stabilizer codes 

The general theory of quantum error correction described in the previous section has pro- 
vided criteria and a recipe for error recovery. However, it does not provide construction 
methods. Just like classical coding, code construction relies largely on the ingenuity of their 
inventors. 

We have seen that the Shor code is related to the classical repetition code. The classical 

and quantum codes also use redundancy in a related manner. A useful connection to make 

^ The enlarged Hilbert space Ti. is different from the unencoded space. The actual improvement in fidelity 
should be the difference T-naS — Te^ where Eo acts on the unencoded space. 
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is to adapt known classical codes to construct quantum codes. Steane |108|, and indepen 



dently Shor and Calderbank developed a class of codes (named CSS codes after their 
inventors) derived from some special linear classical codes over GF(2). Gottesman [55, ^ 



subsequently developed a group theoretical description of quantum codes which is very use- 
ful in constructing and understanding quantum codes. Similar but more general results by 



Calderbank et al |l23| , 24 1 connected quantum coding to orthogonal geometry and classical 
codes over GF(4) and led to the discovery of many codes with important implications. We 
review the stabilizer code formalism due to Gottesman which is more relevant to this Dis- 
sertation. It also provides simple explanations of the CSS codes and fault tolerant quantum 
computation. 

Stabilizer codes 

Consider the n-qubit Hilbert space 7i of 2" dimensions, and the "smaller" Pauli group Qn 
acting on generated by X^''^ and Z^^^ (see Section |2 . 2|) . In this definition, (7y ^ Qn-: 



though Y = XZ = —icTy G Qn- Each element M £ Qn has eigenvalues ±1 or iti, and 
= ±1 (Mt = ±M). Any two elements in Qn either commute or anticommute. Let S be 

an ahelian subgroup of Qn, with n — k hermitian generators. These commuting generators 

define 2""*^ simultaneous eigenspaces. The eigenspace C corresponding to the eigenvalue 
for all generators of S is called the stabilizer code with stabilizer S. ^ Any state \ip) in 

C is stabilized by any element M in S, i.e. Ml-i/;) = C has 2^ dimensions and therefore 

encodes k qubits. 

Let {I^l)} be a basis for C and {Ea} be the set of errors to be corrected. For now, we 

concentrate on t-error correcting codes, therefore {Ea} C Qn contains elements in Qn with 

Specifically, consider a linear classical code Ci with subcode C2 and consider the cosets of C2 in Ci . We 
fix some computation basis, and call its Hadamard transform the conjugate basis. We define a quantum 
code with logical states as follows. Each logical state is an equal superposition of codewords in a coset of C2 
in Ci in the computation basis. It can be proved that each logical state is some superposition of codewords 
in C2 in the conjugate basis. Hence the quantum code can independently correct for X errors using Ci in 
the computation basis and X errors in the conjugate basis due to C2 . But the X errors in the conjugate 
basis are just Z errors in the original computation basis. When Ci and C2 are both good codes, a certain 
number of X and Z errors can be corrected, implying that arbitrary errors in certain number of qubits can 
be corrected. 

* The existence of such an eigenspace is guaranteed by the commutivity and hermiticity of its generators. 
Note that every element M £ S is hermitian with eigenvalues ±1, and = I. In doing so, we exclude 
elements of odd number of K's from S. 
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Hamming weight no greater than t. 

Theorem 5: C is a code correcting {Ea} if Va,6, E\Ei, either (i) anticommutes 

with some M G S" or (ii) EIE^ G S. 

Proof: If Ej^Eb anticommutes with some M & S, 

{il\EIE,\jl) = {zl\EIE,M\jl) = -{tL\MEiE,\jL) = -{ilIeIE.Ijl) =0. (3.29) 
If EiEh G S, 

{iL\ElE,\jL) = {^L\jL) = S^,. (3.30) 

Moreover, if ElEb G S, (EIEi,)-^ = ElEa G S, hence Eq. ( ^!30|) holds when a 
and 6 are interchanged. Combining Eqs. ( 3.2g| ) and ( |3.30D , {iL\ElEi,\jL) = ^ijCab, 



where Cab = or 1 is independent of i, j, and since Caa = !> Cab form a positive 
matrix. Therefore C satisfies the error correction criteria given by Eq. (p.9|). 

A stabihzer code is non-degenerate if, Va ^ b, E^E^ anticommutes with some M ^ S. In 
this case, Cah is diagonal. A degenerate stabilizer code has some correctable errors Ea, E^ 
such that Ej^Ef) G S, meaning that Ea, Ef, act identically on C. Therefore, for a stabilizer 
code, correctable errors are either exactly distinguishable or identical on C. 

The distance of a code d is the minimum weight E £ N{S) — S. It follows that a 
distance d code can correct for t errors, with d > 2t + 1. The notation [[n, /c, d]] is used for 
a quantum code with distance d that encodes k qubits in n qubits. The double brackets 
distinguish quantum linear codes from classical linear codes. 
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We revisit the Shor code, which is a [[9, 1, 3]] stabihzer code with 8 generators: 
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(3.31) 



In Eq. ( |33ll ), rows correspond to generators, and columns correspond to qubits. For exam- 
ple, the first row corresponds to the generator Z®Z®I®I®I®I®I®I®I. The fact 
that the Shor code can correct all single qubit errors can be routinely checked on all Pauli 
operators with weight no greater than 2. The code is degenerate as all the Z generators 
are of weight 2. It means some Z errors are not distinguishable from another, as can be 
verified from Eq. ( |3.6D . The syndrome measurments of comparing qubits or the signs of 
blocks correspond to measuring the eigenvalues of Mj. 



The generator matrix of a stabilizer code (such as Eq. ( 3.31 ), with rows given by the 
generators) plays a role similar to the parity check matrix in classical coding. In the 
stabilizer code, one measures the eigenvalue (±1) of each generator on the possibly corrupted 
received state, which is analogous to a parity check. The measured eigenvalues form the 
"syndrome" in the quantum code. As each distinguishable error takes C to a different 
simultaneous eigenspace of the generators of S corresponding to some different eigenvalues, 
each syndrome corresponds to a unique error in {Ea}- 

Note that the stabilizer formalism gives a very concise description of the code and the 
effects of the errors, because it describes operators rather than states. This "Heisenberg" 
point of view will be even more useful when we discuss fault-tolerant quantum operations. 

The Shor code is an example of a CSS code. In the stabilizer language, the CSS codes 
are stabilizer codes such that each generator can be chosen to be a tensor product of / and 
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either X or Z, but not both. The X generators correspond to the parity check matrix of 
the classical code C2 which handles the Z errors, and the Z generators correspond to the 
parity check matrix of Ci which handles the X errors (using the notation defined in the 
footnote at the beginning of Section 3.3.3| ) . 



A particularly interesting CSS code is the [[7, 1,3]] Steane code | 108 | with stabilizer: 
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(3.32) 



The code is symmetric with respect to interchanging the X and Z generators (C2 = C^, 
hence Ci has to contain its dual.) This code has very attractive fault-tolerant features to 



be discussed in Section 3.4 



The last example of stabilizer code is the smallest possible code which can encode one 



qubit and correct for a single qubit error ||17|, |76[. This [[5,1,3]] code is cyclic, and it 
saturates the quantum Hamming bound. 
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(3.33) 



It is interesting that no CSS code can encode one qubit in five and correct for any single 
qubit error. R 



^ This can be shown by ehmination. Suppose such a CSS code with 4 generators exists. The only 
possibihty is having two X generators. They cannot both be / in any coordinate. Therefore, one generator 
is of weight > 3. Reordering the qubits, the first generator is of the form XXX ■ ■ . Now, any choice of the 
second generator commutes with some Pauh operator with two Z in the first 3 coordinates, completing the 
proof. 
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3.4 Fault-tolerant quantum computation and the threshold 
theorem 

The quantum error correcting codes discussed so far can protect quantum information from 
storage or transmission errors, but not the errors due to the logic gates. Such errors are 
our main concern in actual computation. In particular, we have the following questions to 
address: 

1. Quantum states and gates form a continuous space. Inaccuracies in the logic gates 
will accumulate over a long calculation. 

2. It is necessary to perform operations without introducing and spreading errors in a 
catastrophic manner. 

3. The encoding and recovery procedures are complex and not error free. A net reduction 
of errors is not guaranteed by the existence of quantum codes. 

4. Quantum codes can only reduce the error probability. We may still need an infinitesi- 
mal physical elementary error rate to achieve arbitrarily reliable and long computation. 

5. Finally, even if the feasibility issues are resolved, it is not obvious what are the extra 
resources required to make the computation reliable. 



The above issues were first addressed by Shor [104] and the results were substantially 
extended and improved by many others [H, |ll|, ^ 74, 38, 57, 21, 59, 115]. The essence of 
these remarkable results will be summarized in the rest of this section. 

The discussion is based on the the circuit model of quantum computation, and assumes 
that classical computation is perfect and fast, and that measurements can be performed 
during the computation. We assume independent noise processes on different qubits, and 
the error rates do not grow rapidly with the system size. 

3.4.1 Fault-tolerant quantum computation 

A device that works effectively even when its elementary components are imperfect is said 
to be fault-tolerant. One can identify some basic rules to ensure that the computation is 
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fault-tolerant: 

1. Encode the states ~ Protect quantum information using quantum error correcting 
codes 

2. Encode the operations - Compute with encoded qubits without decoding them. 

3. Use fault-tolerant operations - Use operations such that a single error anywhere can 
only produce at most one error in any other encoded block. In particular, transversal 
operations which only interact a qubit with the corresponding qubit in another code 
block or ancilla are fault-tolerant. 

4. Discretize elementary operations - Use a discrete universal set of gates. 

5. Eliminate potential errors by verification and repetition of measurements. 
We devote the rest of this section to discuss some of these elements. 



Discretization 

The problem with a continuous set of logic operations is resolved by using a discrete universal 
set of gates. The discrete set can be the Clifford group with an additional gate such as the 



Toffoli gate or vr/S gate (see Section |2.2D 



The goal is to find quantum codes and corresponding sets of operations which are uni- 
versal and fault-tolerant. When constructing encoded gates, we assume that the usual logic 
operations (for example, one qubit gates and cnots) can be performed on the physical 
qubits. The gate errors will become errors in the encoded states which can be detected and 
corrected in the usual manner. We focus on fault-tolerant gates on stabilizer codes, and in 
particular the CSS codes. 

Encoded Pauli group in stabilizer codes 

Recall that the stabilizer S is an abelian subgroup of Gn ■ Operators which do not commute 
with all M E 5 take C to its orthogonal complement and are detectable errors. However, 
there may be operations in Gn that commute with every M G S but are not in S. They are 
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the "dangerous" errors that change the encoded qubits without being detected. However, 
these are exactly the legitimate encoded operations on the stabilizer code because they 
evolve codewords to codewords. Mathematically, the centralizer of S consists of operations 
which commute with every M ^ S. For a stabilizer, the centralizer is just the normalizer of 
S in Q, denoted by N{S), which is the set of all operations which permute the elements of 
S by conjugation. f\ Since M G S acts trivially on C, the encoded operations can be taken 
as elements in N{S)/S. 

Consider an [[n,A;,(i]] stabilizer code. S has n — k generators (and 2""'^ elements). 
The generator can be extended to a maximal independent set of n mutually commuting 
observables (generating 2" commuting observables) by k extra elements Zi , Z2 , • • • , in 
N{S) / S. Then, the simultaneous eigenstates of Zi in C can be chosen as the encoded logical 
states. In particular, the encoded |ci, C2, c^) is the (—1)'^' eigenstate of Zi. This turns 
the Zi into the encoded cTz for the i-th logical qubit. The remaining elements in N[S)/S 
will not commute with all of Zj, and each Xi can chosen from N[S)/S which commutes 
with all Zj for j ^ i and anticommutes with Z^. Xi acts as the encoded Gx for the i-ih. 
logical qubit. 

Encoded Clifford group operations 



Stabilizer formalism and Heisenberg representation |5^, 58, ^ The stabilizer for- 
malism describes the transformation of the operators instead of the states. In general, if a 
state \tp) is initially stabilized by M, 

UMU^U\^p) = U\^p) , (3.34) 

and U\ijj) is stabilized by UMU^ . In other words, the stabilizer M is transformed to UMU^ 
when the state is transformed by U. 

We can use Eq. ( |3.34 ) to find encoded operations. Suppose we want to find an implementa- 



tion of the encoded operation U in terms of operations on physical qubits. Any operator W 



such that WX,W^ = UXiW and WZiW^ = UZiW is a vahd implementation of U. We will 

^ Let Ae g„ and M e S. AM A'' = G 5 if [M, A] = 0. AM A'' = -M ^ 5 if {M, A} = 0. Therefore, 
[A, M] € S VM G 5 iff AM A'' G S VM G S, meaning that A permutes the elements of S. 
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use this method to find the encoded CHfford group operations. We consider the action of 
one operator on another by conjugation. (Recall Oi takes O2 to O3 means O1O2OI = O3.) 

Recall from Section ^l2| that the Clifford group, N{Q) can be generated by the Hadamard 
gate H, the phase gate P = \fZ = Diag(l, i), and cnot. H interchanges X and Z, P takes 
X to iY and Y to iX, and CNOT12 takes XI to XX and IZ to ZZ and preserves IX and 
ZI. The encoded Clifford group elements are defined similarly: H interchanges X and Z, 
P takes X to iY and Y to iX and so on. 



Self-dual CSS codes We restrict attention to a class of CSS codes which admit simple 
encoded Clifford group gates. Let C be an [n, d] classical code. Let the generator and 
parity check matrices be G and H respectively. [] Suppose C is self-dual. Then, G = H. We 
have seen that G has k rows while H has n — k rows. Hence, n = 2k and C is a [2A;, k, d] 
classical code. We represent C by listing its codewords as rows in a matrix C. We order the 
rows as: 










Mo 







1 






Ml 


1 





(3.35) 



The [2k — l,k, d'] punctured code (with d — 1 < d' < d) is defined by 



Mn 



Cn 



(3.36) 



Ml 



Let the parity check matrix for the punctured code be Hp. We claim that Hp generates 

^The symbol H is used for both the Hadamard gate and the parity check matrix. The usage should be 
clear from the context. 
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Mq. Let Vp € Mq. From Eq. (3.35), Vp is orthogonal to every row in Mq or Mi since C is 



self-dual. Hence, Vp can be generated by Hp. Conversely, if Vp can be generated by Hp, it 
is orthogonal to every row in Cp. Define the vector v to be that with prepended to Vp. 
V is orthogonal to all rows in C and is therefore in C. Since the first coordinate of f is 0, 
Vp £ Mq. Hence Hp generates Mq. 

Note that Mq is closed as a subspace (since C is linear) and Mi is a coset of Mq. Con- 
structing a quantum CSS code using cosets of Mq in Cp, we get a [[2k — l,l,d — 1]] quantum 
code, with both X and Z matrices in the generator of the stabilizer equal to Hp. 

Encoded Clifford group elements in CSS codes These CSS codes have remarkably 
simple encoded operations. First of all, the X and Z matrices of the generator are both 
Hp C Mq. As Mq is self-dual, every generator has even weight. Therefore, X and Z 
can simply be chosen to be the bitwise |^ X and Z which commute with the stabilizer 
and anticommute with each other. H which interchanges X and Z can be chosen to be 
the bitwise H. A cnot between two code blocks is just the bitwise cnot between the 
corresponding qubits in the code blocks. If the classical code C is chosen to be doubly even 
(i.e. weight of every word is a multiple of 4), the weight of every generator in the stabilizer 
is a multiple of 4, and bitwise P and are in N{S). P is bitwise P or P'^ depending on 
whether k is odd or even. Bitwise operations are automatically transversal. 

The Clifford group elements can be performed on a general stabilizer code, though they 



are more complicated. These can be found in [|5^, p7| . 



Fault tolerant syndrome measurements 

So far, we have discussed unitary operations only. We now consider measurements, which are 
essential in error recovery and preparation of standard states. We consider measurements 
of bitwise operators which have eigenvalues ±1. These include (but are not restricted to) 



Pauli operators. We first consider the scheme detailed in |104, and then derive another 
scheme with a slightly different interpretation. 



* If a procedure of applying an operation to every qubit in a code block implements the encoded version 
of the operation, it is bitwise. We also call an operation bitwise if it is a tensor product of operations each 
acting on a single qubit of a code block. 
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Consider n qubits, and a subset, /C, of k qubits. Suppose we want to measure the parity 
of the qubits in /C. This corresponds to measuring the operator Zj^ = Jlie/c ^^^^ ■ ^ic can 
be measured by performing a cnot from each qubit in /C to a fixed anciha |0), which is 
flipped iff the parity is odd (see Fig. ^I5|). 



|0) 



— 1 


1 






F 




M 















Figure 3.5: Circuit measuring the parity of a subset /C of qubits. Only qubits in IC are 
shown. 



However, this method is not fault-tolerant, as a phase error in the ancilla can propagate to 



all the qubits it has interacted with. The solution in [104] is to prepare an ancilla in the 
"cat-state" \ck) = \0)^^ + and apply bitwise H to obtain an equal superposition of all 

even parity states. Now, one can apply a cnot from each qubit in /C to the corresponding 
qubit in the ancilla, and finally apply direct measurements to the ancilla to determine the 
parity. Note that no extra information on the encoded state, besides the parity of IC, can 
be obtained, and an error in the ancilla can propagate to at most one qubit in the code 
block. 



|0)®fc + |i 



H 



I— H 



Figure 3.6: Fault-tolerant circuit measuring the parity of IC. 



In 1 48], this method was extended to any bitwise operator M = Mi 



Mn with 
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eigenvalues ±1 as follows. Each Mi ^ I can be diagonalized to Z by some Ui. Hence, 
M = UZ]qU^ where U = Ui ® ■ ■ ■ ® Un '^s bitwise and JC = {i|Mj / I}. One can measure 
M by applying U'^ to the state, measuring Zjq and applying U . 
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Figure 3.7: Fault-tolerant circuit measuring Mi 
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Using X = HZH, the above figure can be transformed to 
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Figure 3.8: Fault-tolerant circuit measuring Mi 



•Mr, 
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The procedure is to apply a "cat-controlled-M" bitwise from the cat-state to the code block, 
which results in a phase flip to the cat-state iff the measurement outcome is —1. This 
phase flip can be measured in the conjugate basis defined by the Hadamard transform. It is 
necessary to prepare and verify the cat-state to high fidelity, and to repeat the measurement 
to avoid measurement errors, but these steps can be done. 

Measurement induced logical operations 



It is possible to use measurements to effect an evolution or to prepare certain states ||5^, |57|. 
We first consider what happens to the stabilizer S and the encoded Pauli operators when 
a measurement of K £ Q is made. If K £ S, the measurement is trivial. If K £ N{S)/S, 
measuring K corresponds to a measurement of the encoded logical state. Therefore, we 
assume K ^ N(S). In this case, K anticommutes with some element Mi in S, and we 
may choose the first generator of S to be Mi, and all other generators to commute with 
K. Measuring K projects the state to some ±1 eigenstate of K, leaving Mi unchanged 
in S for i > 2. We can "put" K into the stabilizer in the following way. Nothing needs 
to be done if the post-measurement state IV') is a +1 eigenstate of K. If is a — 1 
eigenstate of K, then A\tp) is stabilized by K for any A that anticommutes with K because 
A\ip) = A{-K)\^p) = K{A\ip)). If furthermore, A commutes with Mi for i > 2, A\^p) 
has stabilizer generated by A and {Mj}j>2. (Such A always exists, since A = Mi is one 
possibility.) Therefore, by measuring K and "fixing-up" if necessary, one can replace Mi 
by K in S. The transformation in the (encoded) state is given by the changes in N{S) /N. 
Note that if is a generator for N{S)/S, MN is still a generator VM G S. Therefore, the 
effect of measuring K is to replace N by A^Afi if (and only if) N anticommutes with K. 

In summary, when we measure an operator K, we perform the following procedure on 
the stabilizer and the encoded operations X and Z: 

1. Identify an element Mi £ S that anticommutes with K. 

2. Rewrite X, Z and the remaining generators of S to commute with K by multiplying 
by Ml if necessary. These rewritten operators are equivalent to the old ones. 

3. Replace Mi by K to obtain the new S and N{S)/S. 
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Gn-NiS) =^ 

Detectable errors 



N{S') 



N{S)/S = 
Logic gates 



S = Idenity 



N{S')/S' 



S' 







Measure K 



Figure 3.9: Using measurement as logic operation. 

This technique to transform the stabilizer is very useful for preparing known states, as will 
be used in Chapter ^. 

Universality 

As mentioned before, the Clifford group is not universal. However, the Clifford group 
together with an extra gate such as the Toffoli gate or the vr/S gate is universal. The 
construction of these gates is made possible by special ancilla preparation and measurements 
which can be made fault-tolerant. A detailed description of this is the subject of Chapter]^. 

3.4.2 The Threshold Theorem and Overhead 

We have described how quantum error correcting codes can reduce the error probability if 
the logic operations are perfect and how fault tolerant methods can be applied to avoid the 
propagation of errors. However, the above results do not guarantee a net reduction of errors. 
The coding procedure increases the complexity of the circuit and therefore can increase the 
net error probability, especially if the elementary error rate is too high. Moreover, such a 
procedure may require a lot of space and time resources. The threshold theorem for quantum 
computation provides a reassuring answer to the above concerns. It can be stated as follows: 



Provided the noise in individual quantum gates (or storage) is below a certain 
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constant threshold, it is possible to efficiently perform an arbitrarily long quan- 
tum computation reliably. 

The threshold theorem is made possible by concatenating well chosen quantum error cor- 
recting codes to obtain super-exponential reduction in net error probability while increasing 
the space-time requirement only exponentially. To be specific, suppose a computation of T 
steps is to be performed. The error per step should be about Paim = ^/T. Suppose the ele- 
mentary error probability is p, and a t-error correcting code of block size n is used, reducing 
the error from p to cp*+^ for some constant c. If we further use n of these first-level-encoded 
qubits to encode one second-level-encoded qubit, and if the structure of the circuit is pre- 
served (self- similar) the error will be reduced to c{cp^^^Y~^^ . If L levels of concatenation 
are used, the error will be reduced to c(i+(*+i)+-+(*+i)^"')p(*+i)^ = c-i/*(ci/*p)^*+^^^ • It 
is clear that concatenation is useful iff p < pth = c~^/*. To achieve the desired level of 
precision, we set paim ~ c"^/*(ci/*p)^*^^'''^ , in which case {t + 1)^+^ = i°g(P°yMfe) . The 



space requirement increases by a factor of = '°fog(p/^(^)''^ ' i &nd similarly for the 

time requirement. The requirements are only poly log in the desired accuracy or the length 
of the computation, which is efficient enough in most applications. 



(logt+i n) 



3.5 Summary and preview 

In this chapter, we have summarized the major result in the development of quantum error 
correction. We have reviewed classical coding theory. We have discussed the Shor code, 
the criteria for quantum error correction, the stabilizer formalism and the stabilizer codes, 
fault-tolerant quantum computation and the threshold theorem. 

This Dissertation contains various results related to quantum error correction, and they 
are organized as follows. 

• Chapter |^ consists of a series of related results on the direct construction of quantum 
codes without classical analogs. This approach is complementary to the one described 
in this chapter and is useful for exploiting knowledge of the noise process to improve 
the efficiency of the codes. Codes with rates unmatched by general codes will be 
presented. Surprisingly, these codes violate the quantum error correction criteria, 
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and the understanding of such violation leads to a relaxed criteria for quantum error 
correction. 

Chapter ^ describes a systematic method to construct gates outside the Clifford group 
fault-tolerantly. These gates are needed in addition to the Clifford group operations 
to form a universal set of gates. The existing constructions are difficult to understand 
and generalize. A systematic construction which explains and generalizes most known 
constructions will be given. 

Chapter ^ describes an experimental test of quantum error correction in NMR. It 
details the modifications required in NMR to perform the coding scheme, and the 
analysis that conclusively demonstrates the net reduction of error due to coding. A 
systematic study of deviations from the ideal behavior is presented. 



Chapter 4 

Amplitude damping codes &; 
approximate QEC 

In this chapter, we wih describe direct construction of quantum codes without classical 
analogs. We describe a class of bosonic codes and a 4-qubit binary code for amplitude 
damping, which exploit knowledge of the noise process to outperform the rate of the general 
codes. We also obtain a relaxed criteria for quantum error correction. These original results 
are reported in |^, [7^ . 

4.1 Bosonic codes for amplitude damping 

We have seen in Chapter ^ how quantum error correction is possible theoretically, and how 
it can be useful for reliable computation even when the coding operations are imperfect. 
Most known codes assume the Pauli error basis (/, X, Y, Z), and coding is performed to 
allow correction of arbitrary unknown errors on a number of qubits. The smallest general 
1-error correcting code requires 5 qubits to encode one qubit. However, in a given physical 
system, the dominant decoherence process is of a specific nature which may admit a simpler 
description. An important question therefore arises: given a particular decoherence process, 
what is the optimal quantum error correction scheme? 

While a general solution to the above question remains to be found, we describe progress 
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towards such a solution. In Sections 4.1.1- |4L^ , we demonstrate a new class of quantum 



error correcting codes which correct only one particular noise process known as amplitude 
damping. In contrast to other previous work, we consider bosonic systems which occupy 
the Hilbert space |0) • • • |A^). Moreover, our codes are constructed directly from the quan- 
tum error correction criteria without classical origins. Some of these bosonic codes achieve 
better rates than any general code. In particular, we found a code with corrects for one 
amplitude damping error using effectively n = 4.6 qubits to encode one qubit. We present 
necessary and sufficient conditions for the codes, and describe construction algorithms, and 
performance bounds. 

4.1.1 Amplitude damping model 

Amplitude damping |82, first introduced in Section 2.4 for the qubit case, describes 



the energy loss from the system to a zero temperature environment. This is a good ap- 
proximation to many real life systems. In this section, we extend the discussion to higher 
dimensions. Amplitude damping can be studied by modeling the system as a simple har- 
monic oscillator. The energy exchange between the system and the environment is given 
by the interaction Hamiltonian: 

Hi = x(a^b + b'^a) (4.1) 

where a, b are the annihilation operators of the system and the environment respectively 
and X is a coupling constant. As mentioned in Section p.4| , complicated interactions can 
often be described by much simpler models. In this case, a single harmonic oscillator for 
the environment is sufficient to model the dynamics of interest. 

We denote the amplitude damping process between the times t and t + At by £. We can 
derive an operator sum representation of £ by assuming that the environment is initially in 
the ground state |0) and by taking the partial trace along the number eigenstates |A;): 

£{p) = TVe [e~^^^^*(p® |0)(0|)e^^^^*] (4.2) 
= T.^kp4 (4.3) 

k 

where = fe(A:|e-^>^^*('^'^+^'<^)|0)b , (4.4) 
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and b denotes the environment model state. describes the event of losing k quanta from 
the system to the environment. Operator algebra techniques [^] can be used to explicitly 
evaluate the inner product, giving 



A 



k 



E ^ [Ij ^{l-ir-h^ \n-k){n\. (4.5) 



In Eq. ( |4.5| ), 7 = 1 — cos^(xAt) is the probability of losing a single quantum from the system 
during time At. |^ Note that Aq ^ I - even when no quantum is lost to the environment, 
the state of the system is still changed. 
For a pure initial state IV'), 

N 

m)m = Y.^k\^)^^\^l (4-6) 

fc=0 

is a mixture of unnormalized pure states. In this case, we may use a shorthand 

N 

[^') = @A,\^), (4.7) 



fc=0 



where the symbols and "©" are reserved for statistical mixtures of pure states. Identical 
states in the mixed sum can be combined using the rule 



a\<t^)®h\4>) = ^\a\'^ + W\cl,). (4.8) 

The normalization of each pure state component gives its probability of occurrence. 

So far, we have described the effect of amplitude damping on a single register. Consider 
now a system with m registers undergoing independent amplitude damping. An initial pure 
state 

IV'm) = hi • ■■nm) (4.9) 



^ T his t ime behavior in our simple model differs from a more detailed model, such as the one in Ap- 



pendix A.l. Such distinction is irrelevant for the present purpose. 
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becomes the mixed state 



[(pout) 



N 

0^fc|no> 

k=0 



N 
lk=0 



(4.10) 



with [N + 1)™" possible final states. It is convenient to use the shorthand 



(4.11) 



where kj is the j-th digit of the number k written in base N+1, so that we may rewrite 
Eq. (^ ) as 

[i>out)= AfclV'm)- (4.12) 
fc=0 



As an example, consider the amplitude damping of the state 



IV-m) = a|01) + 6|10). 



Using 



Ao = |0)(0| + yr^|i)(i| 

Ai = V7|0)(l|, 



(4.13) 

(4.14) 
(4.15) 



the output state can be found to be 



(4.16) 
(4.17) 



This result can be understood intuitively: The original state only contains a single quantum, 
thus, whenever it is lost, the final state must be the vacuum. This example indicates that 
the state of Eq. ( [4.13| ) is useful for detection of a single quantum loss. However, it cannot 
recover the input state upon detection of an error, so cannot be used for error correction. 
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4.1.2 Example 

Let us motivate our code construction by considering the following example: We encode the 
logical zero and one states of a single qubit as 



|0l) = 



|40) + |04) 



V2 



|U) = |22), 



such that the initial state is the arbitrary logical qubit 



(4.18) 



\An) = a\OL)+b\lL). 



(4.19) 



The possible outcomes after amplitude damping may be written as 

[^out)=^\h)=^A~k\^in), 
k k 



(4.20) 



where we express k as a base 5 numeral, and is an unnormalized pure state (the norm 
of which gives its probability to occur in the mixture). For small loss probability 7, the 
most likely final state will be 

|</'oo) = (l-7)'lV'm), (4.21) 

corresponding to no quanta being lost to the bath. The next most likely states result from 
the loss of a single quantum: 



1001) = V2^(l - 7)^/' H03) + 6|21) 
l-^io) = v^(l-7)'/' [a|30) + 6|12)" . 



(4.22) 
(4.23) 



States resulting from the loss of more than one quantum occur with probabilities of order 
7^. Therefore, we aim at correcting up to losing one quantum. Each such error Ei takes 

\0l) and |1l) to states \Oi)i and |li)j respectively. The key is that lOx,), IOl)^ and \lL)i 
\/i are mutually orthogonal, and so are {(poo)^ \4'0i)y ^-nd |<^io)- In principle, a ("quantum 
non-demolition") measurement scheme can detect all error syndromes. Furthermore, for 
each i, the norms of \OL)i and \lL)i are equal. After detecting an error syndrome i, one can 
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apply an appropriate unitary transformation converting \OL)i and \lL)i, to and 
respectively. This makes possible the correction: 

a\OL)^ + b\lL)i ^ a [o|0l) + , (4.24) 

where a is independent of a, b. Note, this is done without any information about a, b, 
and without diminishing the amplitude of the erroneous state. For this particular code, the 



output state has fidelity |102| , |72|| (see also Eq. ( 4.60 )) T = 1 — 67^ with respect to the 
input. 

As a comparison, if |0l) = |11), = |22), the most probable state is |i;^oo) = o(l — 
7)|11) + 6(1 — 7)^122). No unitary transformation will bring it back to + 6|22) unless 

a, b is known. Alternatively, a fixed non-unitary transformation can revert the change, but 
it will reduce the fidelity of the correction process to 1 — 0(7). 

In the next section, we turn to the criteria for a scheme in which k qubits may be 
encoded so that losses up to t quanta may be corrected. 

4.1.3 Code Criteria 

We consider a non-degenerate code which will correct up to t losses. The code encodes 
+ 1 logical states in m bosonic registers each having a maximum of N quanta and -|- 1 
dimensions. We define /C(s) to be the set of all m-digit base-(A^ -|- 1) numbers whose digits 
sum to s (corresponding to the errors having exactly s losses). The logical states must 
satisfy the criteria for quantum error correction, Eq. ( |3.9D , 

{ci,\AlAy\ci,) = for h^h or k^k' (4.25) 
(q|4a^|q) = <7fc W (4.26) 

for all k,k' G U^<j/C(s). In Eq. ([4.26| ), is some constant which depends only on k. 
Equation ( ^lEj ) requires that all erroneous states be orthogonal, and Eq. (|4.26D requires 



that the encoded Hilbert space not be deformed. 

We now present an explicit statement of these two conditions as algebraic conditions on 
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the code construction. We first consider a logical state |q) which is an equal superposition 
of Ni energy eigenstates, \ni . . .rim)- We shall borrow from |5^] the name quasi- classical 
states, "QCS" for short, because these states resemble the classical codewords. When all 
the QCS are equally weighted, we call the code "balanced" . Otherwise, the code is referred 
to as "unbalanced" . Each logical state can be represented by a matrix with m columns and 
Ni rows, each row being one of the QCS in the codeword. For instance, if 



1 



nil - ■■ nim) H h \nNii ■ ■ ■ riNim) 



then the corresponding matrix Mi is: 



(4.27) 



nil ni2 
nN,i nN,2 



n2m 



TIN, 



im 



(4.28) 



Non-deformation condition 



For t = errors, we have /C(0) = {0}, and Ao|nji . . . riim) = (1 — 7)^'^'/^|nii . . . Uim) where 
the row sum RSi = Yl^=i'^ij- Criteria given by Eq. ( [4.26| ) require the norm squares of 
^oIq) be the same for all |q), that is: 



1 



Ni ^' 
' 1=1 



(4.29) 



be the same for all |q). A sufficient condition for Eq. ( 4.29 ) is that RSi be independent of i 
and /. In other words, all QCS in all logical states have a total of Nt quanta. Physically, 
this requirement stems from the fact that a state with higher number of quanta decays 
faster. To preserve the a posteriori probability of each logical state |q), we must encode 
them in a subspace in which the decay probabilities are equal for all of them. Denote the 
set of all QCS with Nt quanta in m registers as Q{Nt, m). The non-deformation constraint 



This is not a necessary condition. The Shor code of Eq. (p.q) satisfies Eq. (4.29) without equal row 
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Eq. (|]2|) is satisfied for t = if we construct all the |c/) from states in Q{N'r, 



mj. 



Similarly, for i = 1 error, we have JC{1) = {0 • • • 01, • • • 10, . . . , 1 • • • 00}, and, for example. 



1 



Ao...i|q) = Ao...i--^^\nii . . .riim) (4.30) 



i=l 



2^\l \nii...nim-l) , (4.31) 



1=1 



where the criteria of equal row sums for t = is assumed. Taking the norm square of 
Eq. i^M ), we obtain 

(q|4...i^o...i|q) = ^ E^i-- (4-32) 

' i=l 

The non-deformation criteria require the above sum to be the same for all |q). Equivalently, 
the column sum of the m-th column of each Mi divided by Ni has to be independent of I. 
Similar expressions for all the Aj^ give rise to a full set of criteria: 

Lemma 2: Let each logical state |q) be expressed as an m column, Ni row 
matrix with elements riij. If |c/) satisfy the equal row sum criteria for i = 
losses, and Y.i'nij/Ni = yj V then (qI^^^Iq) = g-^, Mk G /C(l). 

These criteria correspond to certain symmetry requirements in the |q). A similar result can 
be derived for t = 2: 

Lemma 3: Same setting as in Lemma Let us choose |q) which satisfy the 
non-deformation criteria for t = 1, and such that ^iriij^riij^/Ni = yj^j2 fo'^ ^ 
\ci), where (ji, js) G [l,Ni] x [l,iV;]. Then (qI^^^Iq) = g-^, Vfc G /C(2). 

Proof: We need to work out for each k and apply the criteria for 

i = 0, 1, 2. For t = 2, /C(2) = {0 • • • 02, • • • 20, . . . , 2 • • • 00, • • • 11, • • • 101, 
. . . , 11 • • • 0}. For instance, 

^o-02|q) = Ao...Q2—r=^\nii...nim) (4.33) 



E \l ' AT, l^^i' • • • ' - 2)) ' (4-34) 
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where {') is the usual binomial coefficient. The norm square of this state is 



1=1 



(q|^o...02^o-02|q> = ^ 2^nim{nim-l) ■ (4.35) 



The J:T= 1 n-im term is independent of / by the criteria for t — 1; hence, 

m 

^E^L (4.36) 

has to be independent of / to satisfy the non-deformation criteria. Other Aj^ 
with k = - ■ ■ 02, . . . , 2 • • • 00 impose the above requirement on other columns. 

Similarly, /c = • • • 11 changes |q) to: 



2_^\l \nii . . . (njm_i-l)(nim-l)) . (4.37) 



i=l 



which has norm square 



Ni ^ 



nim-iriim ■ (4.38) 



Eq. ( 4.26 ) requires the following to be independent of I: 



1 

' 1=1 



— ^riim-inim . (4.39) 



Similar results can be obtained for other k with I's at any two registers ji and 
j2- When we allow ji = j2, we include the previous result for two losses at the 
same register. This completes the proof of the lemma. 

Generalization to arbitrary t is as follows: 

Theorem 6: Let each |q) be expressed as an m column, Ni row matrix with 
elements riij. If we choose |q) such that: 

E ^ih^ih ■ ■ ■ nijjNi = yj,,j2,-,js (4.40) 
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are independent of |q) V/, y{ji,j2, ■ ■ ■ ,js) G [1,-^;]* and Vs G then 

Proof: For an arbitrary s, the non-deformation constraints for s losses are 
equations involving up to s powers of riij. Using the constraints for fewer than 
s losses, we reduce the new constraint to involving exactly s powers of riij. By 
mathematical induction, the result for an arbitrary s < t can be obtained. This 
completes the proof of the theorem. 

The above theorem can be generalized to unbalanced codes in which logical states are un- 
equally weighted superpositions of QCS. If the amplitudes of the QCS in |q) are {^/JIi, 
• • • , ^fi ) , we replace the sum J2i '^ijif^ij2 ' ' ' ''^ijt/^i by J2i ^J'if^ijif^ij2 ' ' ' ^ijt^ i-^-) '^^ replace 
the equal weights by the ^Uj's. 

As t increases, the non-deformation criteria become very restrictive. We have found 
unbalanced codes by numerical search correcting up to t < 4 (Section [4.1.7] ) which have no 
analogs in the balanced codes. On the other hand, for t < 2, we found simple construction 
algorithms for balanced codes with no apparent counterparts for the unbalanced codes. 

Orthogonality condition 

The other criteria, the orthogonality constraints given by Eq. ( [4.25| ) can be satisfied as 
follows. Let |n) = \ui . . . Um) and \v) = \vi . . . Vm) be two states in Q{Nt, m). We define a 
distance between u and v as 

i 

Clearly, < P < iVr- Moreover, V{u,v) = V{v,u), V{u,u) = 0, and 

V{u,v) +V{v,w) = + (^-^2) 

i 
i 

= V{u,w). (4.44) 
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Thus P is a metric on the discrete space Q{NT,m). (For binary states, P is half of the 
Hamming distance.) Define the distance between any two logical states |Qj) and \ci,^) to 
be the minimum of T>{ui,U2) over all QCS l^i), \u2) in |q^) and Iqj) respectively. Two 
logical states with non-negative amplitudes of the constituent QCS are orthogonal iff theiv 
distance is non-zero. We therefore have the following: 

Theorem 7: Let |qj) and Iqj) be formed from the states in Qi and Q2 respec- 
tively, where Qi, Q2 C Q{NT,m) and V{ui,U2) > t Vtii G Qi,U2 G Q2- Then 
{ci,\AlAyK)=0, y~k,~k' e[js<tlC{^)- 

Proof: Let ^^,|Qj) = Aj^lci^) = \di^), and let \vi), \v2) be QCS in \dij^), 

1^(2) respectively s.t. V{di-^^, di^) = 'D{vi,V2)- Let \ui), \u2) be the QCS in |Qj), 
Iqj) that are mapped to \v2) by Aj^,, Aj^. Then, 'D{ui,vi) = 'D{u2, V2) < t/2, 
and V{vi,V2) + V{ui,vi) + 'D{u2, V2) > 'C'iui, U2) > t. Hence, 'D{vi, V2) > and 
V[di^.,di^) > 0. Therefore, \di-^) and {di^) are orthogonal states. □ 

In other words, by constructing logical states using QCS which are sufficiently far apart, 
the orthogonality conditions of the erroneous states can be maintained. 

Example revisited 

In view of the above non-deformation and orthogonality conditions, it is obvious why the 



example in Section |4.1.2| can correct for one loss. The logical states are represented by 

(4.45) 



4 
4 



Ml 



2 2 



with all QCS taken from Q(4, 2) and T>{do,di) = 2. We now generalize this example to a 
systematic construction. 

4.1.4 Construction Algorithm For t < 2 Balanced Codes 

In this section, an explicit procedure to obtain a class of balanced codes to correct for t = 1 
and t = 2 errors is presented. 



74 



CHAPTER 4. AMPLITUDE DAMPING CODES <fe APPROXIMATE QEC 



To correct for t = 1 error, consider ordered m-tuples {xi,X2, ■ ■ ■ , Xm) such that X1+X2 + 
. . . + Xm = n. We will use the same symbol Q(n,m) for the space of all such m-tuples as 
well as the space of all QCS {|a;iX2 • • • x^)}- Let TZ be the cyclic permutation on Q(n, m), 
i.e. 7?.((xi, X2, • • • , Xm)) = {x2, . . . , Xm, xi). Define the order of an m-tuple to be the size of 
its orbit under TZ. Since the order p must divide m, let m = pq. An element of order p looks 
like {xi, . . . , Xp, xi, . . . , Xp, . . . , xi, . . . , Xp) with the string (xi, . . . , Xp) repeated q times. The 
orbit looks like: 

(xi, X2, ■ ■ ■ 1 Xp, , Xi, X2i ■ ■ ■ 1 Xp) 

{X2,X3, ...,Xi, ,X2,X3,...,Xi) 

(xp, Xi, . . . , Xp—i, , Xp, xi, . . . , Xp_i) . (4.46) 

Each logical state is taken to be an equal superposition of QCS in some orbit: 

|c) = -^i \xiX2---Xp XiX2---Xp) 

+ |X2X3---X1 X2X3---Xi) 

+ ••• 

+ |xpXi---Xp_i XpXi • • -Xp-i)) . (4.47) 

States formed by distinct orbits are orthogonal, as the orbits partition Q{n,m). Further- 
more, we multiply each number in the QCS by d. The minimal distance between distinct 

QCS is at least d, since distances come as multiples of d only. Hence, all the erroneous 
states will remain orthogonal after t losses il t < d. Logical states are now in the form: 

dxp ■ ■ ■ dxidx2 ■ ■ ■ dxp) 
dxi ■ ■ ■ dx2dx3 ■ ■ ■ dxi) 

dxp-i ■ ■ ■ dxpdxi ■ ■ ■ dxp_i)) . (4.48) 



c) = ( \dx1dx2 ■ ■ ■ 

+ \dx2dx2, ■ ■ ■ 

+ ••• 

+ \dxpdxi ■ ■ ■ 
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For the non-deformation criteria, the row sum is nd = Nt by construction. The column 
sum divided by A''; (= p) is: 

dxi-\ h dxp dn 



P 



m 



in any logical state, independent of the order of the constituent QCS. Codes in examples 



(l)-(3) in Section 4.1.7 are constructed in this way. 

To correct for t = 2 errors, the t = 1 criteria have to be satisfied as well. We take a 
subset of the t = 1 logical states which satisfy the extra non-deformation criteria for t = 2. 
We also replace d = 2 hy d = 3. For m > 2, pairs of logical states in the form 



|0l> 

\w 



1 



\dxidx2 • • • dxm) + \dx2dx3 ■ ■ ■ dxi) + • • • + \dxmdxi ■ ■ ■ dxm-i) (4.50) 



\dxm ■ ■ ■ dx2dxi) + \dxm-i ■ ■ ■ dxidxm) + • • • + \dxidxr. 



■ dx-i 



(4.51) 



always satisfy the non-deformation criteria for t = 2 (proof omitted). Example (4) in 
Section |4.1.7] is constructed in this way. These codes encode one qubit. 

For t > 3, we perform a numerical search for special QCS in which the system of linear 
equations for the weights is linearly dependent. In the best case, the number of linear 
equations to be solved can be much reduced. Therefore we can find codewords involving 
fewer QCS, fewer number of registers and fewer number of quanta. Although encoding is 
certainly possible with a much smaller Hilbert space, we have not found a systematic way 



to generate such QCS. Codes correcting i < 4 errors are exhibited in Section 4.1.7 



4.1.5 Existence of codes 

How large must Nt, m, and Ni be to satisfy both the non-deformation constraint, 
Eq. ( [4.26| ), and the orthogonality constraint, Eq. (4.25)? We now show that an unbalanced 
code exists for arbitrarily large t if Nt is allowed to be arbitrarily large, and give an upper 
bound for the required Nt- 

Let |co), |ci), . . ., |q), . . ., |q^), be /o + l logical states, each being an unequally-weighted 
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superposition of A'^; QCS in Q{n,m). For convenience, define 

^/ N / n + m — 1 \ (n + m — 1)\ / , 

r{n,m)=[ UT ^-^2 

\ m — 1 I n\[m — 1)\ 

as the number of all possible partitions of n into m non-negative parts, i.e., the number of 
ways to write xi + X2 + ■ ■ ■ + Xm = n |63]. Suppose we choose Nt = nd such that 



V (^'"i) = 'P{n,m) >No + Ni + N2 + --- + Ni^ = Nqcs , (4.53) 

where Nqcs is the total number of QCS in all the logical states. By Theorem 2, all the QCS 
involved can be chosen to be distinct, and multiplication of the number states hy d = t + 1 
allows the orthogonality condition to be satisfied. 

On the other hand, the non-deformation condition involves satisfying a certain number 
of constraint equations, given by the total number of possible errors times Iq. The possible 
number of ways to lose s quanta from m registers is just the number of partitions of s into 
m parts, V{s,'m). Take the QCS to be arbitrary, and solve the non-deformation constraint 
equations (of Theorem 1, generalized to include unbalanced codes) as linear equations for 
the weights of the QCS. As long as the number of variables {Nqcs) are no fewer than the 
number of equations, solutions always exist. We may also augment the system of equations 
hy lo + l equations to ensure the correct normalization of each logical state. Hence, for N^ 
satisfying 

l + lo + loJ2'P(^^'^)^^QCS<'p(^,m) , (4.54) 
codes with m registers correcting t errors exist. 



We simplify Eq. ( 4.54 ) by writing explicitly the expression for V{NT/{t + l),m) and 
P(s, m) and performing the summation (by writing the summand as a telescopic sum). We 
obtain 
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For example, when m = 2, Eq. ( [4.55| ) becomes 

lo(t + 2)(t + l) , Nt , , 

— ^ + /o + 1 < ^ + 1 , (4.56) 

which gives a scaling law Nt ~ t^lo/'i- The scaling of Nt as a function of t for arbitrary but 
fixed m can be obtained by approximating the factorials involving Nt and t in Eq. (4.55) 



/ , \ 2m-l 

using the Stirling approximation. We found that Nt ~ ( ^ ) ™ ^ • We have also 
assumed to be large in obtaining the scaling law, and this is a consistent assumption. 
Note that this upper bound is generally much larger than necessary, as can be seen in the 
examples for i = 3 and t = 4. Much more efficient codes may be obtained, because the QCS 
may be chosen to give redundant constraint equations. This may be accomplished either 
systematically (Section |4.1.4 ), or by numerical search (Section [4.1.7| ). 



4.1.6 Rates and Fidelities 

The performance of these bosonic quantum codes can be characterized by the rate - number 
of qubits communicated per qubit transmitted, and by the fidelity - the worst-case over- 
lap between the input and the decoded and corrected output. We now discuss these two 
measures. 

The rate r is given by the ratio of the number of encoded qubits to the equivalent 
number of qubits in our ambient Hilbert space: 

^ (4.57) 



mlog2(iV + 1) ' 

where 2^ = number of logical states, and {N + l)"* is the size of the Hilbert space in our 
code. The exact number of possible logical states depends on the choice of N (maximum 
number of excitations in any single register) and m (the number of registers). For t = 1, 
we have worked out a counting scheme, but omit the details here. However, the majority 
of the QCS have order m. Hence, to a good approximation, the number of logical states 
obtained is: 

2^ = . (4.58) 

m 
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Thus, the asymptotic rate of our codes for large n is For small n, logical states involv- 



ing fewer than m QCS allow slightly more qubits to be encoded compared with Eq. ( 4.58 ) 



(see examples (l)-(3) in Section 4.1.7 ). This small gain can be important in certain appli- 



cations. For arbitrary but fixed t, recall from Section 4.1.5 that Nt ~ " ^ 



t '"-I is 



large enough to guarantee the existence of a code with /q + 1 logical states. Together with 
the fact N < Nt, a loose asymptotic lower bound for the achievable rate r = ^^^^ can be 
obtained for our code. 

We now turn to the code fidelity which we desire to know as a function of the 



parameters Nt, m and t. Recall from Section 3.3.2 that the minimum overlap fidelity is 
defined as: 

T = mm^^^{'^pin\pf\^pin) , (4.59) 

where p/ is the final output state after correction. When the correction criteria are sat- 
isfied, recovery procedures exist for the correctable errors which recover the input states. 
Therefore, the fidelity is at least the total detection probability of the correctable errors: 

•^ = mm^„. E {i'^n\AlA-,\i^ir,) . (4.60) 
fceU,<,'C(s) 

Let the input state be \ipin) = Z^iO^dcz)- Using the orthogonality and non-deformation 
conditions, 

{ij,n\AlA~^\i^,r.) = \{ci\AlAj^\ci)\, (4.61) 
for any Expressing |q) as 

Iq) = VJ^lniinu ■ ■ ■ riim) (4.62) 

+ VJj^\n2in22 ■ ■ ■ n2m) (4.63) 

+ ••• (4.64) 

+ \^Jj^\nNiinNi2 ■ ■ ■ riNim) , (4.65) 



4.1. BOSONIC CODES FOR AMPLITUDE DAMPING 



79 



it follows that for k = {ki, k2, ■ ■ ■ , km) £ JC{s): 



{ci\AlA-,\c,)\ = (1 -7)^^-VE/^4 ) . (4-66) 



=1 \ J / 



and using the following relation for binomial coefficients: 



_ / rail \ / 71^2 \ n 



we find that the fidelity is 



(4.68) 



= l-(^^^Jy+^ + 0(7*+^). (4.69) 

This expression holds for balanced codes as well as unbalanced codes. The amazing feature 
is that given a code which satisfies the orthogonality and non-deformation constraints, J-' 
is independent of m; it is determined only by Nt and t. 

One should note that although codes can be constructed to correct for an arbitrary 
number t of losses, Nt increases with t, which in turns implies a higher loss probability for 
the system as a whole. These two effects compete against each other to give an upper bound 
on the fidelity, which can be estimated as follows. Let Nj- be the required total number of 
quanta, and t be the total number of losses to be corrected. As previously discussed, the 
quantum error correction criteria reduce the two parameters to one degree of freedom, in 
terms of which we may estimate the optimal achievable fidelity. In terms of t, the optimum 
fidelity for fixed 7 is obtained by setting 

^ln(l-.F)=0. (4.70) 



From Eq. (|^, this gives to first order in 7 



80 



CHAPTER 4. AMPLITUDE DAMPING CODES & APPROXIMATE QEC 



Using the Stirling approximation for the factorials in we obtain 

\NT-t-lJ dt \ t + l J ' ^ ^ 



In general, Nt is much larger than t, which allows further simplification of Eq. ( 4.72 ): 



"^^^ * Inf-^) +ln7 = 0. (4.73) 



dt Nt \Nt 

The exact dependence of Nt on t is generally very complicated. This point can be appreci- 
ated from the explicit code examples following this section. In particular, the minimum Nt 
depends on the existence of "good solutions" to the criteria, therefore, the optimal fidelity 
is analytically intractable. We only have a bound on the fidelity of a bosonic code with 
arbitrary QCS. There is no theoretical bound on the number of correctable errors. 

For concreteness, we will illustrate the above assuming that Nt asymptotically follows 
a power scaling law in t. As illustrated in the previous section, Nt is bounded by such 
polynomials in t. Therefore the following gives a loose lower bound of the upper bound of 
the fidelity. Suppose Nt ~ /^o^" where the prefactor / and exponent a are approximately 
constant. Eq. ( 4.73| ) can be solved for the optimum t: 



topt^ie-yjflof^'^-'^ . (4.74) 

Plugging back into the Eq. ( [4. 691) would give an estimate for the optimal achievable fidelity. 
However, these gross estimates are not expected to be meaningful in actual applications, 
because NT{t) will be the determining factor, and as previously mentioned, is analytically 
unobtainable. 



4.1.7 Explicit Codes 

Some explicit codes resulting from our work are presented here. Normalization factors are 
omitted whenever they are common to all logical states. Codes are specified as {{Nt, m, lo + 
1, d)), where Nt is the total number of excitations in the QCS, m is the number of registers 
for each QCS, Iq + 1 is the number of logical states and d is the minimal distance between 
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the logical states. The fidelity of all the codes are given by !=ti I 
Example (1) - ((4, 2, 2, 2)), n = 2, i = 1, fidelity Ri 1 - 67^: 



|0l> 



1 

72 

|22) 



[|40) + |04)] 



(4.75) 
(4.76) 



Example (2) - ((12, 3, 10, 2)), n = 6, i = 1, fidelity JT « 1-667^ labels given in hexadecimal 
(c = 12, a = 10): 



I Co) = 

|ci) = 

\C2) = 

|C3) = 

|C4) = 

|C5) = 

ke) = 

\C7) = 

ks) = 

leg) = 



1 

1 

71 
1 

71 
1 

73 

1 

71 
1 

71 
1 

71 
1 

71 
1 

71 

1 444) 



00c) + 
02a) + 
048) + 
066) + 
084) + 
0a2) + 
228) + 
246) + 



264) + 



|c00) + |OcO)] 
|a02) + |2a0)] 
1 804) + 1 480)] 
1 606) + 1 660)] 
|408) + |840)] 
|20a) + |a20)] 
|822) + |282)] 
1 624) + 1 462)] 
1 642) + 1 264)] 



Example (3) - ((6, 3, 4, 2)), n = 3, t = 1, fidelity 1 - 

|co) = |600) + |060) + |006) 

|ci) = |420) + |204) + |042) 

|C2) = |240) + |402) + |024) 

|C3) = |222). 



(4.77) 

(4.78) 

(4.79) 

(4.80) 

(4.81) 

(4.82) 

(4.83) 

(4.84) 

(4.85) 
(4.86) 



(4.87) 
(4.88) 
(4.89) 
(4.90) 
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Example (4) - ((9,3,2,3)), n = 3, t = 2, fidelity JT 1 - 8473; Note this code differs 
from the previous one from having d = 3 instead of d = 2. We take only |ci) and |c2) as 
codewords. 

|0l) = |306) + |063) + |630) (4.91) 
= |036) + |360) + |603) (4.92) 

Example (5) - ((6,4,2,2)), n=6=0+l+2+3, fidelity ^ 1 - 157^: The minimal distance 
between QCS is d = 2. However, the QCS are not generated by multiplying each number 
by d = 2. 

|0l) = |0321) + |1032) + |2103) + |3210) (4.93) 
lU) = |0123) + |1230) + |2301) + |3012) . (4.94) 

Example (6) - ((7, 2, 2, 2)), fidelity ^ ~ 1 — 217^: The logical states arc not formed by cyclic 
permutations of the QCS. Note that column one and two have different column sums. 

|0l) = |70) + |16) (4.95) 
= |52) + |34) (4.96) 

Example (7) - ((9, 2, 2, 3)), fidelity J^;sil- 847^: Unbalanced code that will tolerate t = 2 
errors. Note that one codeword is formed from the other by reversing the order of the 
registers. (This symmetry between the two registers is a sufficient condition for balanced 
codes with i = 2, m > 3.) 

|0l) = ^|90) + ^|36) (4.97) 
|1l) = ^|09) + ^|63) (4.98) 

Example (8) - ((9, 3, 2, 3)), fidelity J^;sil- 847^: Unbalanced code that will tolerate t = 2 
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errors, showing that the symmetry is not a necessary condition for correcting t = 2 errors. 

|0l) = ^[|036) + |306) + |360)] (4.99) 
v3 

= ^[V6|333) + V2|009) + |090)] (4.100) 

Example (9) - ((16,2,2,4)), fidehty JT 1 - 18207^: Unbalanced code that will tolerate 
t = 3 errors. Labels are given in base 17. c and g denote 12 and 16 respectively. 

|0l) = -^[\0g) + \gQ) + V6\88)] (4.101) 
= ^[|4c) + |c4)] (4.102) 

Example (10) - ((20,3,2,4)), fidelity ^ 1 - 48457^: Another unbalanced code that will 
tolerate t = 3 errors. Labels are given in base 21. c, g and k denote 12, 16, and 20 
respectively. 

|0l) = l[\0ig) + 2\40g)+2V5\0k0)] (4.103) 
o 

= -^[V2\Uc) + V3\m)] (4.104) 

Example (11) - ((50,2,2,5)), fidelity T ^ 1 - 21187607^: Note the rapid growth in the 
numerical factor in the second term. To correct for large number of errors, we need to 
encode a qubit in a large Hilbert space, but emission probabilities are large for high number 
states. This puts a limit of performance in our codes. The actual code involves numbers 
five times the numbers shown below, a denotes 10. 

|0l) = \/5lOa) + |46) + V^|82) + yj|91) (4.105) 
n~ IT /"33" FT PT 

\W = Vis'^^^ + Ve'^^^ + Vgo'^^^ + Vs'^^^ + Vgol"^^- ^^"^"^^ 
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4.1.8 Discussion 

Our treatment of amplitude damping errors contrasts from the usual standpoint of quantum 
error correction, which deals with X and Z errors. The relationship can be understood 
by expressing the and Ai operators as coherent superpositions of such errors; from 
Eqs. (|4l4|) and (^ ), 



^0 = ^[ii + VT^)i + {i-VT^)z] (4.107) 
Ai = ^ - y] . (4.108) 

With probabilities up to 0(7), a binary code with m qubits will either project ^q^Iq) 
onto a state with no errors, or project Aq^~^ Ai\ci) onto a state with one X 01 Y error. 
Hence, a binary code correcting for any one qubit error will indeed correct all amplitude 
damping errors up to losing one quantum, although not to all orders in 7. One reason we 
have studied bosonic codes is to exploit the possibilities for achieving higher efficiencies or 
easier physical implementation, though the study is theoretically interesting on its own. 

It is important to realize that amplitude damping errors are not independent qubit 
errors, since the decay factor of each QCS depends on the total number of excitations in it. 

Rates from our bosonic codes contrast with those achievable by the usual binary codes. 
For the code in Example (1), N = Nt = nd = 4, m = 2 and A; = 1, so r = 0.22. This is 
slightly better than r = 0.20 for the five qubit binary perfect code described in Eq. ( |3.33| ), 
and much better than the eight qubit code of Plenio et. al. |93[| . 

The code fidelities may also be compared. Our ((4, 2, 2, 2)) code achieves 1 — 67^. 
In comparison, the five-qubit binary code achieves fidelity « 1 — 1.757^, while the eight- 
qubit code achieves only ~ 1 — 67^! This agreement with the bosonic code is not accidental; 
it stems from the use of the same total excitation number. However, it is worthwhile to 
point out that despite the effort to balance the codewords, the five-bit code still has better 
performance on average, due to the smaller number of excitations involved in the system. 

In conclusion, we have given general criteria for quantum error correction of amplitude 
damping in bosonic states and have constructed codes based on the amplitude damping 
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operation elements. This generalizes the binary error correcting codes for Pauli errors, and 
specializes to address the dominant decoherence process of many systems such as photons 
transmitted through optical fibers and trapped ions. We classify our errors according to 
the number of excitations lost, instead of the more common classification of the number 
of qubits or registers corrupted. We have shown that specialization to correct amplitude 
damping can improve the rate in some cases. However, direct construction of codes can 



be difficult. In Sections 4.2-4.3, we will describe an improved method which leads to even 
better codes. 

4.2 Approximate quantum error correction can lead to bet- 
ter codes 

We have seen that codes constructed for specific noise processes can have better efficiency 
than general codes based on classical codes and the Pauli error basis. Unfortunately, without 
the specific structure in the Pauli error basis, there is no general method to construct 
quantum codes. There is no apparent method to adapt classical coding techniques without 
the Pauli-error basis. The approach to solve the code criteria directly can be very difficult, as 



can be seen in the bosonic code construction. Consequently, few such codes are known [33 
§,11, 



As the quantum error correction criteria are both necessary and sufficient, code con- 
struction has always aimed at satisfying the criteria exactly. One important result of this 
Dissertation is the observation that the criteria is not necessary. We developed a new 
approach to quantum error correction based on approximate satisfaction of the original 
quantum error correction criteria. These new criteria are important in two ways: they ad- 
mit more codes and they can be much simpler algebraically. Therefore, they are very useful 
in seeking more efficient codes for specific noise processes and in direct code constructions. |^ 

The usefulness of this approach is immediate. Using the approximate criteria, we have 
discovered a four-qubit binary code which corrects for single qubit amplitude damping 



^ In the general code construction based on the Pauh error basis, the criteria are naturally satisfied 
exactly. 
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errors. Incidentally, this code was not found even after a substantial amount of effort by 
the community, because it violates the exact criteria. Moreover, such a short non- degenerate 
code is impossible using the Pauli basis. The reason can be understood by examining the 
operation elements of amplitude damping: 



To first order in the error probability 7, n + 1 possible errors may happen to an n-qubit 
code using the Aq, Ai error basis, so it follows that n > 3 is required. In contrast, in the 
Pauli basis, any X or 1" error must be corrected by the code, so that 2n + 1 possible errors 
must be dealt with. A non-dcgcncratc code has to map the codeword space to orthogonal 
spaces if the syndrome is to be detected unambiguously. Hence, the minimum allowable 
size for the encoding space is the product of the dimension of the codeword space and the 
number of operation elements to be corrected. It follows that n > 5 qubits are required for 
a non-degenerate Pauli basis code, in contrast to our four-qubit code. 

The lessons are that (1) better codes may be found for specific error processes, and 
(2) approximate error correction simplifies code construction and admits more codes. Ap- 
proximate error correction is a property with no analog in digital classical error correction, 
because it makes use of the slight non-orthogonality possible only between quantum states. 
We describe our approach to this problem by first exhibiting our four-qubit example code 
in detail. We then generalize our results to provide new, relaxed error correction crite- 
ria and specific procedures for decoding and recovery. We conclude by discussing possible 
extensions to our work. 

4.2.1 Four Bit Amplitude Damping Code 

Recall that single qubit amplitude damping is defined by 



^0 = ^[{i + VT^)i + {1 - VT^)Z 



(4.109) 



(4.110) 



S{p) = Yl ^kpA\ 



(4.111) 



fe=0,l 
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^0 = 


1 


Ai = 


V7 




VI -7 








(4.112) 



The probability of losing a photon, 7, is assumed to be small. To correct errors induced by 
this process, we encode one qubit using four, with the logical states 



|0l> 



1 
1 

72 



|0000) + |1111) 
|0011) + I 1100) 



A circuit for encoding the logical state is shown in Fig. 4.1 



|0) 









r 




r 




( 


3 ( 





(4.113) 
(4.114) 



Figure 4.1: Circuit for encoding a qubit. The third register contains the input qubit. 



Using the notation defined in Section [4.1.1| , the possible outcomes after amplitude damping 
may be written as 

[V'o«t>=0|</'^,>^0^fc|V'in>, (4.115) 
k k 

where k are binary strings labeling the errors (for example ^oio- 
the input state 

li^in) = a\OL) + b\lL) , 



Ao'E)Ai(g)Ao---). For 
(4.116) 



all possible final states occurring with probabilities 0(7) or above are 



i'OOOO/ 



^1000/ 



^0100/ 



|oooo) + (1-7)^11111) 

72 



+ b 



(1-7)[|0011) + 11100)] 



V2 



'7(1- 


7) 


2 


'7(1- 


7) 



a(l -7)|0111) +6|0100) 
a(l -7)|1011) +6|1000) 



(4.117) 
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0010/ 



^0001/ 



^7(1- 


7) 


2 


'7(1- 


7) 



a(l -7)|1101) +6|0001) 
a(l -7)|1110) +6|0010) 



The usual quantum error correction criteria Eq. ( p. 9] ) require (Oil^t^^lOi) = (lj;,|^t^^|li), 
but 

(Ol|4ooo^oooo|Ol) = 1-27 + 372 + 0(73) (4.118) 



( 1^1 ^0000^0000 1 1l) 



1 - 27 + 7^ 



(4.119) 



So the code we have constructed does not satisfy the usual criteria. We will demonstrate 
that the code satisfies new approximate error correction conditions later on, and revisit the 
recovery procedure afterwards. First, we exhibit how the code works. 

Decoding and Recovery Circuit 

Let us denote each of the four qubits by ni,...,n4. Error correction is performed by 
distinguishing the five possible outcomes of Eq. ( [4.117 ), and then applying the appropriate 
correction procedure. The first step is syndrome calculation, which may be done using 
the circuit shown in Fig. |4.2| A. There are three possible measurement results from the two 
meters: (M2, M4) = (0,0), (1,0) and (0,1). Conditioned on (M2, M4), recovery processes 



Wk implemented by the other three circuits of Fig. 4.2 can be applied to the output in ni 
and 71-3. 

If (M2, -M4) = (0,0), then ni and are in the state: 



|00) + (1-7)2|11) 
V2 



+ b 



:i-7)(|01) + |10)) 



V2 



(4.120) 



To regenerate the original qubit, the circuit of Fig. [4.2| B is used: a cnot is applied using 
ns as control, giving 



a|0) 



|0) + (1-7)^11) 
V2 



+ b\l) 



(1-7)(|1) + |0)) 
V2 



(4.121) 
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ni 



M-, 




(A) 



|0>- 



"3 >>- 



IVout) 



(C) 



IVout) 



- 1^/4-9 — 



(B) 



nj 

|0> 

«5 



(D) 



Figure 4.2: (A) Circuit for error syndrome detection. The measurement result is used to 
select Wk out of three actions. If the result {M2, M4) is 00, 10, or 01, circuits (B), (C), or 
(D) are applied, respectively, to recover the state. The not gate, X, is represented by the 
usual classical symbol. The angles 0, 6' are given by tan^ = (1 — 7)^ and cos^' = 1 — 7. 
The rotation gate and controlled-rotation gate specified by an angle 9 perform the functions 
exp(i6ay) and Ai(exp(i6ay)) respectively in the notation of |13]. 



ni can now be used as a control to rotate ns to be parallel to |0). We obtain as the final 
output in ni and n^: 



«\/^Mr^|0) + 6(1-7)11) 



|0) 



(l-7)(a|0)+6|l)) + 0(7^)10) |0) , 



(4.122) 
(4.123) 



with the corrected and decoded qubit left in rii as desired. 

If (M2, M4) = (1,0), the inferred state before syndrome measurement is c/'iooo or i?!>oioo- 
In either case, ni and ns are in a product state and ns is in the distorted state: 



'(1-7)7 



a(l-7)|l)+6|0) 



(4.124) 



To undo the distortion, we apply the non-unitary transformation in Fig. OC. The combined 
operation on ns due to the not gate, the controlled-rotation gate and the measurement of 
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the ancilla bit can be expressed in the operator sum representation: Af{p) = NqpNq + 
NipNl, where 



1 
1-7 



, iVi = ^^7(2-7) 





1 



(4.125) 



The A'^o s-iicl Ni operators correspond to measuring the ancilla to be in the |0) and |1) states 
respectively. If the ancilla state is |0), we obtain the state: 



1^, 



out/ 



'(1-^)3^ 



a|0) +6|1) 



(4.126) 



in the third register because Nq preferentially damps out the 6|0) component in Eq. ( 4.124 ). 
We get an error message if the ancilla is in the |1) state. Finally, if {M2, M4) = (0,1) the 
same procedure can be applied as in the (M2, M4) = (1,0) case, with ni and 77,3 swapped. 

The fidelity, defined as the worst (over all input states) possible overlap between the 
original qubit and the recovered qubit is 



.F= (1-7)2 + 4 



(l-7)'7 



1-572 + 0(7^), 



(4.127) 



Note that the final state Eq. ( 4.123 ) is slightly distorted. This occurs because the recovery 
operation is not exact, due to the failure to satisfy the code criteria exactly. Furthermore, 



the circuits in Fig. |4.2| C and 4. 2D have a finite probability for failure. However, these are 
second order problems, and do not detract from the desired fidelity order. 

4.2.2 Approximate Sufficient Conditions 

We now explain why our code works despite its violation of the usual error correction 
criteria. The reason is simple: small deviations from the criteria are allowed as long as 
they do not detract from the desired fidelity order. To make this idea mathematically 
concrete, we present a simple generalization of the usual error correction criteria described 



in Section 3.3.2. These approximate error correction criteria are sufficient conditions for 
approximate error correction. 



We first summarize the exact criteria described in Section 3.3.2, retaining all previously 
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defined notations. We consider a quantum code C which is a subspace in a larger Hilbert 
space TL, acted on by a noise process £{p) = J2neK ^nPMi- The criteria for C to correct for 
the errors G Are, or equivalently, for £'{p) = Z^neACre ^npAj^ to be reversible on C, is 
given by 

PcAiAnPc = gmnPc Vm, n G /C,e , (4.128) 

where Pc is the projector onto C, and gmn are entries of a positive matrix. It is always 
possible to rewrite £' {p) = J^neKrc ^^P^n such that 

PcAlAnPc = PnSn^nPc Vm, n G JCre , (4.129) 

where pn are non-negative c-numbers. Without loss of generality, we will use Eq. ( [4.12S| ) 
and omit the tilde to simplify notations. 

The sufficiency of Eq. ( p^ can be established as follows. When Eq. ([4. 129]) is satisfied, 



the An operators have polar decompositions 

AnPc = VP^UnPc Vn G ICre , (4.130) 

where the U^s are unitary and PcU\UmPc = ^nmPc- The recovery operation TZ is defined 



as 



n{p)= RkpRl + PEpPE, (4.131) 

k£Kre 

where Rk = PcUl is the appropriate reversal process for each A^ G Are- When we apply TZ 



to £{p), the first term in Eq. ( 4.13l| ) is given by P'^'^^p where P*^*^* = J^nefCreP"- total 
detection probability for the reversible subset. P"^*^* is a lower bound for the fidelity J- for 
pure input states. ^ 



Now we generalize Eq. ( [4.129| )- ([4.130 ) based on the following assumption: the error 



is parametrized by certain small quantities with physical origins such as the strength and 
duration of the coupling between the system and the environment. For simplicity, we 
consider only one-parameter processes, and let e be the small parameter. For example, e 



^ The fidelity can be generalized to mixed input states f = ^1^{PoutPinPout) ) which is minimized at pure 
input states. Therefore, P'''=* lower bounds the fidelity for all input states. 
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can be the single qubit error probability. Suppose the aim is to find a code for a known £ 
with fidelity: 

T>1- 0(e*+^) . (4.132) 

In the new criteria, it is still necessary that P'^'^'' > that is, Are has to include all An 
with large detection probability max|^^^^g(j Tr(|V'm)(^m|^n^n) ~ C'(e*), s < t. However, 
it is not necessary to recover the exact input state; only a good overlap between the input 
and output states is needed. In terms of the condition on the codeword space, it suffices 
for the An to be approximately unitary and mutually orthogonal on C. These observations 
can be expressed as relaxed sufficient conditions for error correction. Suppose 



AnPc = Un^Pc^nAnPc , (4.133) 

is a polar decomposition for An- We define c-numbers pn and A„ so that p„ and Pn^n are 
the largest and the smallest eigenvalues of PcAl^AnPc, considered as an operator on C. 
The relaxed conditions for error correction are that: 

Pn(l-A„) < 0(e*+i) VnG/C,e (4.134) 

PcUlUnPc = SmnPc- (4.135) 

Note that when = 1, Eqs. ( [4.133| )-( [4.135D reduce to the exact criteria. In the approximate 
case, P*^*^* = J2n£K.re Ti'dV'm) (V'ml^n^n) IS uot a Constant, but depends on the input state 
\ipin)- Since Are includes enough errors so that P'^'^'' > 1 — 0(e*^^), when Eq. (4.134) is 
satisfied, we also have 'EneKrePn > 1 " 0{e^+^) and 'EneJCrePn^n > 1 - 0(6*+^). 

We now prove that J2n£K.re Pn^n is a lower bound on the fidelity. Defining the residue 
operator 



7r„ = ^PcAlAnPc - VpJ^Pc , (4.136) 



we find, for the operator norm of tt^j, 



< |vr„| < - a/p„A„ , (4.137) 
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and 

AnPc = UniVKKI + ^n)Pc ■ (4.138) 

The sufficiency of our conditions to obtain the desired fidelity may be proved as follows. 
Though Eq. ( 4.13C| ) is not satisfied, as long as Eq. ( 4.135| ) is true, we can still define the 



approximate recovery operation 

n{p)= J2 RkpRl + PEpPE (4.139) 

with Rk = PcUl being the approximate recovery operation for A^, and Pe is as defined 
in the case of exact error correction. For a pure input state \ipin){ipin\, applying TZ on 
£ i\ipin) ii^inl) , and ignoring the last term which is positive definite produces an output with 
fidelity 

= min Tr I |-0m)(-0m| ^(^^(|V'm)('0m|)) 

> min Yl \{i'^n\ulAr,\i^,n)\^ . (4.140) 



Omitting all terms for which k ^ n and applying Eq. (4.138) gives 



JT > min ^ \{^Pin\VPnK + -^n\tpin)\'^ > Pn^n , (4.141) 



where in the last step, we have used Eq. ( 4.137| ). Hence, the fidelity is at least X^neACre Pn^n > 
1 — ©(e*"*"^) and the desired fidelity order is achieved as claimed. 

An explicit procedure for performing this recovery is as follows. First, a measurement 
of the projectors Pk = UkPcUl is performed. Conditioned on the measurement result, k, 
the unitary operator Uk is applied to complete the recovery. 

4.2.3 4-bit Code Revisited 



In terms of the approximate quantum error correction criteria Eqs. ( 4.134| )-( |4.135| ), we may 



understand why our four-bit amplitude damping quantum code works as follows. We present 
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matrices with respect to the orthonormal basis ordered as: 



|0000), |0011), |1100), |1111), |0111), |0100), . 



(4.142) 



The projection operator |0/,)(0£,| + onto the codeword space C is 



Pc = -, 



10 1 

110 
110 
10 1 



(4.143) 



where the irrelevant null space is omitted. We are interested in the restriction of to C, 
therefore we exhibit rows and columns in Ak that have nontrivial contributions to A^Pc- 
The operation element corresponding to no loss to the environment is 



A 



0000 



10 

1-7 

1-7 

(1-7)^ 



(4.144) 



The eigenvalues of Pc^oooo^oooo-Pc are (1 — 7)^ and ^(1 + (1 — 7)^). Interested readers can 
check for themselves that 



^oooo^b = t^oooo (1 - 7K + (7 + C(7 ))7roooo Pc 



(the order of 7 in ttoooo is factored out of ttoooo) with the choice: 



cos(0-f) -sin(0-5) 

10 

1 

sin(e-f) cos(e-f) 



0000 



(4.145) 



(4.146) 
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TTOOOO 



10 





1 



(4.147) 



where tan^ = (1 — 7)^, and only the nontrivial restriction to C is exhibited. The exact 
quantum error correction criteria are not satisfied, as -Pc ^0000 ^0000 -Pc has different eigen- 
values. However, the difference is of order 0(7^) and thus the relaxed condition Eq. ( [4.134| ) 
is satisfied. 

For the error which describes losing a quantum from rii, we have 



A 



1000 



(1-7)2^ 




















































































1-7 














1 












(4.148) 



The eigenvalues of -Pc^iooo^iooo-Pc are 7(1 — 7) and 7(1 — 7)'^. The difference is (27^ 



7^)(1 — 7). We have the decomposition 



A 



1000-rc 



'(1-7)7 



1000 



^1 - 7)/ + 77fi000 



(4.149) 



U 



1000 



1 




















1 





























1 














1 














1 














1 












1 

V2 



1 
V2 
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1 


V2 


V2 


1 
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V2 


V2 





















-75 






1 

V2 




10 



1 
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TTlOOO 











V2 







(4.150) 



0^000 





1 



Other one loss cases are similar. For different k, the U^Pc matrices have non-zero entries in 
different rows, and are orthogonal to each other. Hence, all the approximate code criteria 
are satisfied. Using these explicit matrices, we obtain 



n{£{\ij,n){^in\)) 



J2 PcUlAk\lP^n){i^in\AlUkPc 

(l-372)|V»„)(Vm| + ... . 



(4.151) 



The fidelity is thus at least 1 — 37'^, and is of the desired order. 



The recovery procedure suggested in Eq. (4.151) contrasts with the decoding and re- 



covery circuits in Section 4.2.1. It is an interesting exercise to check that the composition 



of the operations in Fig. |4.2[A and 4.2B, followed by re-encoding the recovered qubit has 



the same effect on C as applying Uqqqq for recovery. For the case in which an emission 



occurs in the first qubit, the composition of operations in Fig. 4.2A and |4.2| C, followed by 
re-encoding the recovered qubit has the same effect on C as preferentially damp out the 
Ins) = |0) component followed by applying C/xooo ^'^^ recovery. Note that it costs 27^ in the 
fidelity to remove the distortion. 



4.2.4 Applications to other codes 

Our approximate criteria may also be used to simplify code construction using a non-Pauli 
error basis. For example, consider the bosonic codes in Sections 4.1.2- 4^1^^ . For logical 
states Ici), |c2), . . . of the form: 



/ii|niini2 ■■■riir. 
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+ V/i2h2in22 • • • n2m) 
+ ■■■ 

+ yJWh\nNiinNi2 ■ ■ ■ riNim) , (4.152) 

the original non-deformation conditions for correcting up to one loss of quantum require 
the following to be constant for all logical states: 

Ni 

(q|4^o|Q) = (4-153) 

i=l 
N, 

(q|4...i...oAo...i...o|q) = 5](l-7)'^^'~'7/"i^u- (4.154) 

i=l 

In the above, RSi = jyjLi ^^ij is the total number of quanta in the ith QCS in |q). We have 



seen that it is difficult to find a solution for Eq. ( 4.154| ) when RSi is not constant for all i. 



With the new criteria, it suffices for the following to be constant for all logical states: 

Ni 

^^mriij Mj. (4.155) 

i=l 

That is, the equality of the excitation number in all QCS in all logical states is relaxed to 
the equality of the average number of excitations over the QCS in each codeword. This 
provides an alternative explanation of why the five-bit code 

|0l) = lOOOOO) + IllOOO) - llOOll) - lOllll) 

+ IllOlO) + lOOllO) + lOllOl) + IIOIOI) 

|1l) = llllll) - lOOOll) + lOllOO) - llOOOO) 

- lOOlOl) + IllOlO) + llOOlO) - lOlOlO) (4.156) 



can correct for one amplitude damping error (as described in Eq. ( 4.112| )): although the 
code does not satisfy the exact non-deformation criteria Eq. ([4.154 ) for the non-Pauli error 
basis, it satisfies the approximate ones leading to Eq. ( [1.155|) . 
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4.2.5 Stabilizer description for approximate error correction 

Gottesman has subsequently given an explanation of approximate error correction of am- 



plitude damping using the stabilizer formalism [56|. The amplitude damping operators Aq 
and Ai can be expressed in other error bases: 

^0 = /+ ^~^^^ (/-^) = /+ ^~^f^ i3 (4.157) 
A, = ^X{1-Z) = ^A (4.158) 

where B = [I — Z) and A = X(I — Z). The elements of the reversible set are tensor products 
of /, A, and B with low weights. To correct for t amplitude damping errors, terms up to 
order 7* are relevant. Therefore, we need to consider E'^F for all E,F in the reversible set 
that contain r factors of A and s factors of B where | + s < t. 

It is sufficient to use CSS codes with the X generators capable of detecting t Z errors 
and the Z generators correcting t X errors. If E"^ F contains at least one A, it anticommutes 
with some Z generators (the Z factor in A commutes with the Z generator and does not 
affect the argument). If E^^F has no A at all, it can have up to t and therefore up to t Z 
which anticommutes with some X generator. Comparing with correcting t general errors, 
the requirement of correcting t Z errors is reduced to detecting t errors only. As an example, 
the following code 

X X X X X X X 
Z Z Z Z I I I 

(4.159) 

Z Z I I Z Z I 
Z I Z I Z I z 

can correct for one amplitude damping error and encode 3 qubits. 

It is interesting to note that the above construction which takes into account the ap- 
proximate criteria and only detects Z errors still does not admit a 4 bit code. This can 
be proved by elimination. First of all, no such 3 bit codes exist, as the minimal set of 
generators is {XXX,ZZI,IZZ}. To eliminate any 4 bit code, it suffices to show that at 
least 3 Z generators are required for correcting up to one X error in four. The minimum 
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weight of the Z stabihzers can either be 1 or 2. If the minimum weight is 1, without loss of 
generahty, ZIII is a generator. It takes at least 2 other Z generators to handle X errors 
in the last three qubits. If the minimum weight is 2, and ZZII is a generator, it is obvious 
that no single Z generator anticommutes with all of I IX I, IIIX and IIXX. The lesson 
is, this construction using specific error model and approximate error correction is still not 
optimal, since the error description using X and Z does not exploit special properties of 
the Aq, Ai operators. 

Consider the four-bit code again. It is in fact the 4 bit CSS code we have ruled out: 

Ml = X X X X 
M2 = Z Z I I 
Ms = I I Z Z 
X = X X I I 
'Z = Z I Z I 

Recall that the difficulty is that no stabilizer anticommutes with XXII and IIXX, so that 
the error correction criteria cannot be established for A'^ AH and II A. However, anti- 
commutivity is not required to establish the criteria. In fact, A^ AH x ZZII = —A^ AH, 
therefore, PqA^ AIIPq = —PcA^AIIPc = establishing the error correction criteria with- 
out anticommutivity. More precisely, E'^F can be negated by multiplication by a stabilizer 
(instead of by conjugation). This is unique to a non-Pauli basis, since no stabilizer can 
negate E^F in the Pauli basis by multiplication. This can be generalized, and an imme- 
diate application is the delightful result that the Shor code can correct for two amplitude 
damping errors. It is trivial that the two X generators can detect up to any two Z errors. 
It remains to consider E^F with up to 4 factors of A. Almost all of them anticommute with 
some Z generators except for AAA^ IIIIII (and others with exactly one or two A\ and 
similar ones in the other two blocks) . This is negated by the stabilizer IZZIIIIII. Similar 
arguments apply to all other cases. 

This result is very remarkable: such a binary code correcting for 2 losses is very difficult 
to find in the first place and is also very lengthy to verify directly without the stabilizer 
description, and is impossible with the Pauli error basis. 



(4.160) 
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4.3 Summary 

We have constructed amplitude damping codes in bosonic and qubit systems which have 
rates better than any general codes: 



|0l) = ^ [|04) + |40) 



lU) = |22) 



• |0l) = -^[|0000) + 11111)] |li) = ^ 10011) + 11100) 



These examples demonstrate that choosing an appropriate error basis can potentially reduce 
the requirements in coding schemes. We also suggest an approximate method to enable code 
construction without the Pauli basis to be done more easily. Approximate error correction 
is particularly interesting because it is a property with no analog in digital classical error 
correction, as it makes use of slight non-orthogonality possible only between quantum states. 
It also extends the current scope of quantum error correction, which is closely related to 
digital classical error correction. 



Chapter 5 



Fault-tolerant logic gates 



5.1 Motivation 

We have seen in Section |3.4| that, to robustly perform quantum computation in the presence 
of noise, one needs to perform quantum gates and measurements directly on encoded states 
in a fault-tolerant manner. These fault-tolerant quantum gates and measurements must 
prevent a single error from propagating to more than one error in any code block, so that 
small correctable errors will not grow to exceed the correction capability of the code. This 
requirement greatly restricts the types of unitary operations which can be performed on the 
encoded qubits. We have seen how the Clifford group operations can be performed bitwise 
on CSS codes, and have seen how fault-tolerant measurements of certain class of operators 
can be performed. (These can also be performed on any stabilizer codes.) However, to 
obtain a universal set of gates, at least one additional gate has to be constructed, such as 
the Toffoli gate or the vr/S gate. The first construction of such gates was reported in [104|, 



and was followed by many others |11, 68, 57, ^]. Unfortunately, these constructions 
are given, but not systematically derived. Thus they cannot be easily generalized. The 
construction can also be complicated. This point can be better appreciated by considering 



the circuits for performing the Toffoli gate and the vr/S gate (denoted by T) in Fig. 5T 



101 



102 



CHAPTER 5. FAULT-TOLERANT LOGIC GATES 



(a) 



(b) 



\a) 



x) ^ 



X 



\a) 



\y) 

\z S) xy) 



PX 



— 



Figure 5.1: Fault-tolerant implementation of the Toffoli gate and the vr/S gate. \a) denotes 
the corresponding special anciha states required. \a) = (|000) + |010) + |100) + |lll))/2 for 
the Toffoh gate and |a) = ^(|0) + e*''/'^|l)) for the tt/8 gate. 

While the validity of these circuits can be directly verified, it is not apparent how they 
are constructed. Furthermore, all known constructions Q share some intriguing similari- 
ties which are not understood. They all involve some special ancilla states, and special 
operations conditioned on some measurement outcomes. 

It was first pointed out by Shor |104] that the use of operations conditioned on classi- 



cal measurement outcomes is reminiscent to teleportation [16]. In teleportation, quantum 
states are sent using pre-shared EPR states, measurements made by the sender, classical 
communication performed, and operations made by the receiver conditioned on the mea- 
surement outcomes. Such connection between teleportation and fault tolerant gates were 
not understood, until teleportation is used as a basic primitive to systematically construct 
many fault tolerant gates in |^^. However, the resulting circuits are not as simple as prior 
ad-hoc constructions. 

In this chapter, we uses a simpler primitive dubbed "one-bit-teleportation" to construct 
fault tolerant gates. This provides a systematic and unified construction of a large class of 
important fault-tolerant gates which parallel the simplest existing schemes and improve on 
many others. 



We describe our basic primitive, one-bit teleportation, in Section 5.2. Its application to 



fault-tolerant gate construction is presented in Section 5.3, which is followed in Section 5.4 



^ Except for the polynomial codes and toric codes 
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with specific circuits for the vr/S, controhed-phase, and TofFoli gates. We summarize our 



results in Section 5.7. 



5.2 One-bit teleportation 

We consider a class of "one-bit-teleportation" circuits, each of which is essentially the fol- 
lowing swap circuit for two qubits: 



|0). 



|0) 

1^) 



(5.1) 



Throughout this section, the first and second qubits refer to the registers with respective 
initial states |0) and IV')- Since X = HZH, Eq. is equal to: 



|0)^ 



1^) 



^ H 



H 



0) 



(5.2) 



Since measurement commutes with a controlled-quantum-gate when the control qubit is 
being measured pO| , we have 



U 



u ■ 



(5.3) 



where all notations are as defined in Section |2.2| . Operations which are performed con- 
ditioned on classical measurement results are called classically-controlled operations. In 
Eq. (^.2|), the two qubits are disentangled before the second Hadamard gate. Therefore, the 
second qubit can be measured without affecting the unknown state of the first qubit. Using 
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Eq. (O), Eq. (pi) is equal to 



IV') 



H 



Applying Eq. (U) to U^^p), we obtain the left-hand-side of the following: 



(5.4) 



|0) 



Z — U 1^) 



|0> 



<s^u — uzu^ — IV') 



In particular, when U = H, the right-hand-side of Eq. (5.5) is equal to 



(5.5) 



\0) — H 



X — IV) 



(5.6) 

The circuits in Eqs. ( ^.4| ) and ( |5.6| ) are referred to as "Z-teleportation" and "X-teleportation" 
after the classically-controlled-operation involved. X and Z-teleportation circuits can both 
be represented using the same general structure: 



|0) 

IV') 



A 



E 



D 



B 



(5.7) 



where for Z-teleportation, A = I,B = H,D = Z, and E is a, single CNOT with the first 
qubit as its target. For X-teleportation, A = H,B = I,D = X, and is a single cnot 
with the first qubit as its control. 
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5.3 Fault-tolerant gate constructions using one-bit telepor- 
t at ion 

In this section, we develop a general method for fault-tolerant gate construction using 
one-bit teleportation as a basic primitive. We will confine our attention to the self-dual 



doubly-even CSS codes |23, 108], although the results can be extended to any stabilizer 
code 



5.3.1 Fault-tolerant gate hierarchy 

We first summarize the fault-tolerant gate hierarchy introduced in [^ ]. Let Ci denote the 
Pauli group. C2, the Clifford group, is the set of gates which map Pauli operators into Pauli 
operators under conjugation. We can recursively define an infinite class of quantum gates 
as 

Ck = {U\UCiU^ C Ck^^} , (5.8) 

for k > 2. For every k, Ck-i C Ck is strictly increasing. 

It is now easy to understand what is involved in all existing schemes to complete the 
universality requirement. In all cases, one gate in the set difference, C3 — C2, is added 
to C2 to form a universal set. These include the 7r/8 gate T ||2l|, (T|x) = e*'^^/^|a;) for 
X G {0, 1}), the controlled-phase gate CP ||6^] (cp|xy) = i^'y\xy) for x,y = {0, 1}) and the 



Toffoli gate |104| . We will see in the next few sections that there is a reduction which allows 
the Ck gates to be recursively constructed from performing and measuring Ck-i gates. 
This reduction is what makes it possible to complete the universality requirement out of 
the available C2 primitives. 

5.3.2 C3 gate construction using one-bit teleportation 

We now consider a general method to perform C3 gates fault-tolerantly using the one-bit 
teleportation scheme as a primitive. Let U £ C3 he an n-qubit gate, to be applied to \tp), 
an encoded quantum state with n logical qubits. One can first teleport and then apply 
U to the teleported version of {ip) (why this is useful will be explained shortly). Each logical 
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qubit can be teleported using either Z-teleportation or X-teleportation given by Eq. ( p7i 
We call the gates in Eq. (5.7) Ai, Bi, Ei and for the teleportation of the i-th qubit. 
We relabel the tensor product Ai <^ ■ ■ ■ <^ A^ as A, and similarly for B, D and E. This is 
summarized in Eq. (|5.g|): 



|0)«5" -i^A_ 



E 



D 



U 



B 



(5.9) 

In Eq. ( |5.9| ), the symbol "/"" represents a bundle of n logical qubits. The measurement 
box outputs an n-bit classical outcome represented by the double line. The z-th classical bit 
controls whether Di is performed on the i-th logical state in the first register. For simplicity, 
we draw a box with label D where the double line ends. If Z-teleportation is applied to the 
i-th logical qubit, Ai = I,Bi = H, Di = Z, and Ei is a cnot with the i-th qubit in the first 
register as target; if X-teleportation is applied instead, Ai = H,Bi = I,Di = X, and Ei is 
a CNOT with the i-th qubit in the first register as control. 

The usefulness of the extra teleportation of \tp) before applying U is the following. We 
can commute U backwards in time, so that U acts on the known ancilla state ^|0)®". 
Commuting U with each classically-controlled operation Di changes Di to UDiU^ G C2 
as Di S Ci. For simplicity, we represent the combined controlled-operation as UDW in 
Eq. ( 5.10| ). Likewise, commuting U with E changes E to UEW. ^ To ensure UEU"^ is 
in C2 for an arbitrary U G C3, we consider U and E such that [f/, -E] = 0, in which case, 
UEU'^ = E eC2. Then Eq. (O) becomes 



|0)®n 



U 



UDU^ 



U\'^) 



B 



(5.10) 

All the circuit elements outside the dotted box can be implemented fault-tolerantly. Inside 
the dotted box, we need not apply U to yljO)®"; instead, we can create the logical state 

^ We write UEU'' instead of {U ® I®")E{U'' (g) 7®") for simplicity. Unimportant identity operators are 
likewise suppressed throughout the discussion. 
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U A|0)®" directly. As lO)*^" has stabilizers Zi (i = 1, . . . ,n), where Zi is the encode Z acting 
on the i-th encoded qubit, the stabihzers of UA\0)'^'' are UAZiA^U^ = UAiZiAju'' = 
UDiU^ (following the discussion after Eq. (U)). As UDiU^ G C2 (which can be performed 
bitwise in CSS codes) and have eigenvalues ±1, these can be fault-tolerantly measured 
using the method described in Section 3.4. Measuring UDiU^ Vi allows C/A|0)®" to be 
prepared from any convenient state. Therefore, a beautiful reduction is obtained, from 
performing U € C3 to performing and measuring UDiW which is in C2. This completes 
the discussion on how to perform U fault-tolerantly on any encoded state, provided U G C3 
and teleportation circuits can be found to satisfy [U, E] = 0. 

The above construction can be used to systematically construct interesting gates in 
C3 — C2- We now exhibit the examples of the vr/S gate, the controlled-phase gate and the 
Toffoli gate. Any one of these gates, together with C2, form a universal set of gates. 



5.4 Examples 
5.4.1 The tt/S gate 

The 7r/8 gate, T, has the matrix representation 



1 

e*''/^ 



(5.11) 



Note that T is diagonal and commutes with the cnot in the X-teleportation circuit. There- 
fore, we choose to apply X teleportation to \ip) and apply T to the teleported {tp): 



|0> 



X 



T\'iP) 



We commute T backwards to obtain 



|0) 



H 



W 



T|V) 



^ — Ef^ 
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where W = TXT^f = yfiPX. It remains to prepare the ancilla state, 

1 0) +6*^^/4 1 1) 



TH\Q) 



V2 



(5.12) 



with stabilizer W . We can prepare any convenient state, such as the logical |0), by measuring 
Z fault-tolerantly. We replace Z hy W in the stabilizer by measuring W and "fixing" by 
applying Z if we obtain a —1 eigenstate of W (as {Z,W} = 0) (see Section |3.4| ). This 
completes the scheme to perform the encoded version of T fault-tolerantly. We remark that 
we obtain the same scheme as in 

5.4.2 The controUed-phase gate 



The controlled-phase gate, CP (defined in Section 5.3.1 ), acts on basis states according 
to cp\xy) = i^'y\xy). CP G C3, and together with H and cnot, form a universal set of 
gates p9|, pM]. We use the following circuit symbol for CP: 



(5.13) 



CP commutes with Zi and it acts on Xj (i = 1, 2) as follows: 



X 





X 












z — 


p 





(5.14) 



To perform CP, we teleport IV') and apply CP. We apply X-teleportation to both qubits 
such that the cnots in the circuit commute with CP. As the circuit acts coherently on the 
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input, it suffices to consider an arbitrary basis state \xy). 



\0)-H 



|0>^ 

\x) ^ 

\y) 



X 



X 



(5.15) 



Commuting CP backwards using the commutation rules in Eq. ( |5.14| ), we obtain a circuit 
to perform CP 



\0) — H f- 



\x) 



X 



X 



- \x) 



z — P 



« — pf^ 



(5.16) 

where the double lines control all the operations in the corresponding boxes. It remains to 
prepare the special ancilla state \a): 



H 










H 


— 1 


D 



a) 



with stabilizers Wi = CPXjCP^ for i = 1,2. We can conveniently prepare the state \a'): 



io) Y 

io) -^wy^ 



\a') 



The stabilizer of \a') can be generated by Zi and X2 since CP has no effect when the first 
bit is |0). Therefore, \a') can easily be prepared by measuring Zi and X2. On the other 
hand, the stabilizer of |a') is generated by Zi = cpZiCP^^ and W2 = CpX2CP^. Therefore, 
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we only need to replace Zi by Wi, by measuring Wi to obtain \a). This is possible because 
{Wi,Zi} = and [Wi,VF2] = by construction. This ancilla completes the fault-tolerant 
construction of the controlled-phase gate. 



5.4.3 The Toffoli gate 

To construct the Toffoli gate (controUed-controlled-NOT) , we begin with some useful com- 
mutation rules: 



X 



m- 







X - 






— ( 


) — a 


^ — 



(5.17) 



(5.18) 



Since the Toffoli gate is diagonalized by a Hadamard transform on the target bit, using X 
(Z) teleportation for the control (target) bit ensures the three CNOTs to commute with the 
Toffoli gate. 



|0)- 
|0) 
|0) 
|x>- 

\y) 



X 



X 



\x) 

\y) 

\z ffi xy) 



(5.19) 



'e\ — S 
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Commuting the Toffoli gate backwards using Eqs. (|5T^)-( |5l8| ), Eq. (|5l9| ) is equivalent to 



-a 



It remains to prepare the ancilla \a): 



|o)- 

|0)- 



\a) 



\y) 

\z a;?/) 



(5.20) 



with stabihzers Wi = Xi (g) CNOT23, W2 = -^2 ® CNOT13 and W3 = Z3 CZ12, where CZ 
denotes a controlled-Z, and the ordered subscripts for cnot and CZ specify the control and 
target bits. We start with the state \a'): 



|0) 

|0) 
|0) 



H 






H 




H 







\a') 



on which the Toffoli gate has no effect. Therefore \a') has stabilizers generated by Xi for 
i = 1,2,3 and can easily be prepared. At the same time, the generators of the stabilizer 
can be given by Wi, W2 and X3. Again, X3 and W3 anticommute due to the addition of 
H in the third qubit, but the Wi are mutually commuting. Hence, \a) can be prepared by 
replacing X3 by W3 in the stabilizer by measurement of W3. This completes the ancilla 
preparation and therefore the gate construction. 

Note that the ancilla and the quantum circuit derived are the same as those in Shor's 
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original construction ]104[ . The one-bit teleportation scheme elucidates the choice of the 
ancilla state and the procedure in [104|. 



5.5 Recursive construction 

In this section, we discuss what gates can be constructed with one-bit teleportation as a 
primitive. We extend our discussion to gates in and characterize a class of gates which 
can be recursively constructed. 

We will prove by induction that the diagonal subset of C^, defined by -F^ = {U G 
Cfc and U is diagonal}, can be recursively constructed. First of all, for U £ F/^, we apply 
X-teleportation to each logical qubit to ensure commutivity with the cnots. Second, for 



U e Fk, and D e Ci, UDW = UD where U e Fk-i If the gates in Fk-i can be 

performed, the classically-controlled operator UDU^ for U ^ F^ can also be performed. 
Third, it can be proved by induction that UA\0)^"' can be prepared fault-tolerantly [^]. |^ 
Finally, the gates in F2 C C2 have transversal implementation. By induction, all the gates 
in Fk can be performed fault-tolerantly by a recursive construction. 

The sets F^ contain many interesting gates, such as the single qubit 7r/2^ rotations 
V'' = Diag(l, e*'^/^'') and the controlled-F'^^-'^ = Diag(l, 1, 1, e*'^/^'' ^), which are used in the 



quantum Fourier transform circuit |103, |3^ essential to Shor's factoring algorithm [103|. 



Fk also includes the multiple-qubit gates A„(y') for n -1- / < A; |54], where A„(y') applies 
to the (n -|- l)-th qubit if and only if the first n qubits are all in the state |1). By the 
closure property of Fj. [54], all products of A„(y') for n + I < k are in F^- For small k, 
recursive construction can be more efficient than approximating these gates to an equal 
accuracy using a universal set of fault tolerant quantum logic gates. 

The gates in F^ are not the only ones which can be constructed using the one-bit 
teleportation scheme. If U G C3 is related to an element in F3 by conjugation by Hadamard 
gates in the ii-th, . . ., ii-th. qubits, one can apply X teleportation to those qubits and Z 

teleportation to the rest to ensure [U, E] = 0. The Toffoli gate is an example. More generally, 

^ Since states with stabilizers in Fk can be prepared if cat-state-controlled- _Ffe operations can be performed, 
it suffices to prove the latter. For the induction step, suppose the cat-state-controlled- _Ffe_i operations can 
be performed, one can also prepare states with stabilizers in Fk-i- Together, they can be used to perform 
any cat-state-controlled- -Ffe operation, completing the induction step. The base case F2 is true. 
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C3 gates in the form U = GbVGa for y G F3 and Ga,Gb G C2 can be performed simply by 
performing Ga,Gh directly before and after the scheme for V. The controlled-Hadamard 
gate is an example. 

Note that so far, we only give sufficient conditions for a gate to be implemented by 
one-bit teleportation. The necessary conditions are intriguing, but due to our present lack 
of knowledge about the Ck gates, they are far from being understood. 



5.6 Extensions 

We briefly mention two other applications of the one-bit teleportation circuits beyond fault- 
tolerant gate constructions. 

Hybrid circuits 

The one-bit teleportation circuits can be used to convert data types. For example, the 
data type can be unencoded qubits, qubits encoded with various codes or some cat-states. 
Consider Eqs. (5.4) and (^^) again. They can convert the input data type to the data 



type of the ancilla, if the format of the operations are chosen properly. One immediate 
application is to encode qubits. Another application is to change some encoded data from 
a code Ci to another C2 without ever decoding the data. 

Distributed computation 

The current construction is also well suited to explain distributed computation, in which 
certain non-local operations are not allowed, and have to be performed using pre-shared 
entangled states and classical communications. Teleportation is one such protocol. More 



can be found in |115]. 



5.7 Conclusion 

We have presented a systematic technique to construct a variety of quantum operations, 
by using a primitive one-bit teleportation scheme to reduce difficult gate constructions to 
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measurements and ancilla state preparations. We applied this technique to fault-tolerant 
quantum computation, and have demonstrated simple derivations of the tt/S, controlled- 
phase, and Toffoli gates, and have gained better understanding of how C3 gates are made 
possible. These constructions are easily generalized to realize an infinite family of gates. 
The possibility to directly construct gates outside C2 without going through the universality 
argument can bring large reductions in the resources required for quantum computation. 
Clearly, this means that one-bit teleportation may be useful for designing and optimizing 



computation and communication protocols |35, 37|. Even more intriguing, perhaps, is that 
this result gives us a first glimpse at what might someday be a standard architecture for 
a quantum computer: a simple assembly of one-bit teleportation primitives, capable of 
universal quantum computation on quantum data, given the assistance of standard quan- 
tum states which are obtained as commercial resources. Definition of such an architecture 
could be pivotal in the development of this field, much as the von Neumann or Harvard 
architectures [E^] were important in classical computation. 



Part III 

NMR Quantum Computation 



115 



Chapter 6 

Theory of NMR Quantum 
Computation 

6.1 Introduction 

In this part of the Dissertation, we turn to the more difficult issue of reaUzing quantum 
information processing. The obstacle to building such devices is more than just decoher- 
ence, which can at least be overcome theoretically. The problem is the many simultaneous 
requirements of quantum information processing which contradict each other. It is nec- 
essary that macroscopically controlled measurements and manipulations can be made on 
microscopic quantum degrees of freedom, to accomplish certain basic tasks: (1) prepare 
the system in a fiducial initial state (a pure state such as the ground state), (2) perform a 
universal set of logic gates, (3) implement projective measurement to read out the compu- 
tational results and (4) maintain long coherence times. Many candidate implementations 
have been proposed, but very few have advanced sufficiently to experimentally demonstrate 
a multi-qubit logic gate. Most experimental problems are believed to be technological, and 
will be resolved in the future. However, it is highly desirable to have an immediately accessi- 
ble system to put our theories to test and to motivate efforts in potentially useful problems. 
Solution NMR quantum computation at room temperature was proposed in late 1996 as 
such a possibility (by Gershenfeld and Chuang [^] and independently by Cory et al [pO[). 



117 



118 



CHAPTER 6. THEORY OF NMR QUANTUM COMPUTATION 



Solution NMR is a limited computation model, yet it allows a tradeoff between resources 
and computation power. Implementation is relatively straightforward for a system of up to 
10-20 qubits, and systems of several qubits are immediately available. Moreover, problems 
in NMR are well characterized and therefore present well-defined and meaningful challenges 
to the community. (The problems in most other implementations are not yet understood 
or identified, due to the meager experimental data. Ironically, NMR quantum computation 
has received more skepticism because the problems are better understood.) 

In this part of the Dissertation, some of the theoretical problems in solution NMR quan- 
tum computation are discussed and resolved. It also describes some interesting experiments. 



This chapter describes the original proposal in |53| as well as contributions from this Disser- 
tation that are extensions of the original proposal. Chapters ^ and ^ present two primary 
results of this Dissertation related to the subject. 

6.2 Primitives in NMR quantum computation 

Nuclear spin systems are good candidates for quantum computers for many reasons. Nu- 
clear spins can have long coherence times. Manipulation by complex sequences of RF 
pulses can be carried out easily using modern spectrometers. Coupled operations are built 
in as coupling of spins within molecules. However, the signal from a single spin is too 
weak to be detected using current technology. Bulk NMR quantum computation addresses 
the detection problem by using a bulk sample of identical and independent spin systems, 
such as molecules in solution. These identical systems (quantum computers) run the same 
macroscopically defined quantum algorithm in classical parallelism. However, projective 
measurement of each individual system is impossible. Moreover, a pure initial state cannot 
be easily prepared. The breakthrough in ||5^, 40 1 is to realize that computation can be per- 



formed at room temperature starting with thermal initial states by utilizing the signal of 
the small excess ground state population resembling that of a pure state. Moreover, existing 
quantum algorithms can be modified to use ensemble measurements. These observations 
simplify the implementation to the extent that it becomes immediately realizable. The 
rest of this section describes logic gates, measurements and initial states in NMR. How the 
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small excess population evolves is described in Section E^. Adaptations to enable quantum 



process tomography are described in Section 6.4. How to manipulate the small excess pop' 



ulation to compute properly is described in Section A short review in signal strength 
and scaling is given in Section Finally, the possibility to directly compute on thermal 
inputs will be discussed in Section 6/7 . We aim to develop the background for Chapters ^ 



and ^ as well as to elucidate limitations in NMR and methods to resolve them. 
6.2.1 The quantum system (hardware) 

We shall consider a physical system which consists of a solution of identical molecules. 
Each molecule has n magnetically inequivalent nuclear spins which serve as qubits. A static 
magnetic field is applied externally along the +z direction. This magnetic field splits the 
energy levels of the spin states aligned with and against the external field. Let |0) and |1) 
be the ground and excited states. The Hamiltonian for the Zeeman splitting is given in the 
energy eigenbasis as: 

Wz = -^E'^i^^^ (6.1) 
(' 

where i is the spin index, iVi/lir is the Zeeman frequency for the i-th spin, and Z^*-* is the 
usual notation for acting on the i-th spin. The convention h = 1 is used. We assume the 
spins have very different Zeeman frequencies, a situation loosely termed as "heteronuclear" . 
Nuclear spins can interact via the dipolar coupling or the indirect coupling mediated by 



electrons [10, 107 1. If the molecules tumble fast and isotropically in the solution, dipolar 
coupling and the tensor part of the indirect coupling will be averaged away. In any case, in 
the presence of a strong external magnetic field, only the secular part (the energy conserving 



terms which commute with Tiz) is important [IQ, 107]. For a heteronuclear system, the 



resulting coupling (known as the J coupling) takes the form 



i<j 



which is independent of the exact original coupling. In Eq.(|6.2|), gij denotes the coupling 
constant between the i-th and the j-th spins. We also write gij = because the spectral 
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lines of the i-th and the j-th spins are spht by Jij Hz. 

Combining the Zeeman and couphng terms, the reduced Hamiltonian for our system is 



well approximated by |1C, 107 ] 



Ti = Tiz + 'He + He 



(6.3) 



where Tienv represents coupling to the reservoir, such as interactions with other nuclei, and 
higher order terms in the spin-spin coupling. 



As an example, the energy diagram for the two-qubit case is shown in Fig. 6.1 

|1 1 ) ^ 4 ((O3 + tt)(,)/2 + 71J/2 E 

jbhigh ^high 
|1 0) - -▼- [ -- (CO3 - c0;,)/2 . 



loo): 



low 



(-«a+«b)/2-7rJ/2 

(-«'a-®b)/2 +71J/2 



Figure 6.1: Energy diagram for the two-spin nuclear system, a and b are spin indices, 
J = Jab is the unique coupling constant. The transitions labeled ciiow> ^highi biow 

and 

bhigh give rise to 4 spectral lines, and represent transitions (|0) |1))|0), (|0) |1))|1), 
|0)(|0) ^ |1)) and |1)(|0) ^ |1)) respectively. 



6.2.2 Universal set of quantum logic gates 

We consider the universal set of any coupled two-qubit operation together with the set of 
all single qubit transformations ||4^, ^ (also Section |2.2| ). Both requirements are 

satisfied in NMR as follows. 

Single qubit operations 

Single qubit operations can be induced by pulsed radio frequency (RF) magnetic fields, 
oriented in the xy-plane perpendicular to the static field. An RF pulse can selectively 
address spin i by oscillating at angular frequency w,. An RF pulse along the axis 17 induces 
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the rotation operator e~*2'^'^, where 9 is proportional to the pulse duration and amplitude. 
The rotation operator transforms the Bloch vector given by Eq. ( p.29| ) according to the 
right-hand-rule . 

For example, the Pauli matrix X = ie~^^^ can be performed in a single vr pulse about 
X. The Hadamard gate H = ie'^^""^ e'''^'^^ can be implemented by a 7r/2 pulse about y 
followed by a vr pulse about x. This is an example of a pulse sequence. (Both Z(^) or 
the full notation cjj*^ may be used in this part of the Dissertation.) 

Coupled operations 

Coupled logic gates can be naturally performed by the time evolution of the system. How- 
ever, the Hamiltonian in Eq. ( |6.3| ) does not couple specific pairs of qubits. Rather, all 
couplings occur simultaneously along with the intended one. The fundamental tasks of 
turning on and off specific coupling terms are called "recoupling" and "decoupling". In 
Chapter ^, we show how to recouple and decouple efficiently. For now, we simply assume 
that individual coupling term can be selectively turned on. To perform a coupled opera- 
tion between spins i and j, we turn on the coupling term QijZ^'^^ (8) Z^^^ for time t, leading 
to the evolution or logic gate e~*^*J*^'''''^-^*^' . Entanglement can be created because the 
evolution depends on the state of both spins. A frequently used coupled "operation" is 



e 4 



, which corresponds to an evolution time of t 



2J. 



J-. Appending 



with the single qubit rotations e* 4 * and e* 4 ^ leads to the unitary operation 



Xij 



Jtt/4: 



10 
10 
10 
-1 



(6.4) 



which is the controlled-z operation acting on spins i and j. Together with the set of 
all single qubit transformations, the Xij's complete the requirement for universality. For 
instance, CNOTjj = H^^^ Xij 

can be implemented by concatenating the sequences for 



each constituent operation. It is crucial in decoupling and recoupling that the free unitary 
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evolution can be reversed. 



6.2.3 Measurement 

The measured quantity in NMR experiments is the time varying voltage induced in a pick-up 
coil in the xy-plane: 



V{t) = -VoTr 



(6.5) 



The signal V{t), known as the free induction decay (FID), is recorded with a phase-sensitive 
detector. In Eq.( |6.5D , the onset of acquisition of the FID is taken to be t = 0. 

In the following, we consider a two-qubit system, with spin labels a and b and Jab = J- 
The generalization to n qubits is straightforward. Suppose the density matrix p(0) has 
Pauli decomposition p(0) = J2i,j=o'^ij^i (recall ctq, 1,2,3 = I,o'x,y,z)- One can calculate 
V{t) as a function of t and the coefficients Cij using Eq. ( |6.5| ). Furthermore, if one takes the 
Fourier transform of V{t), one obtains in the spectrum four peaks at frequencies ^ + ^, 
^ ~ ii ^ ~^ ^' ^"4' '^ith corresponding integrated areas ("peak integrals") 



^high 



^lou 



h 



high 



h,. 



Hcio - C13) + C20 - C23 

«(cio + C13) + C20 + C23 

i{coi - C31) C02 - C32 
iicQi + C31) C02 + C32 



(6.6) 
(6.7) 
(6.8) 
(6.9) 



Note that the expression cio — C13 occurring in the high frequency line of spin a is the 
coefficient of ax ® |1)(1| in p(0); cio +C13 in the low frequency line of spin a is the coefficient 
of cr^: (8) 1 0)(0 1 . Likewise, C20 — C23 is the coefficient of (Ty(Xi|l)(l| andc2o + C23 is the coefficient 
of (Ty |0)(0|. These quantities signify the transitions |0) |1) for spin a conditioned on 
spin b being in |1) or |0). Similar observations hold for the high and low frequency lines of 



spin b (see Fig. p.l| ). 
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Quantum state tomography 

From Eqs. (|6l^)-(|6^), we can determine the coefficients cio, C13, C20, C23, cqi, C31, C02, C32. It 
is also possible to determine other aj in the following manner. 

Suppose a 7r/2 x-pulse is applied to the first spin right before acquisition, inducing the 
transformation p{0) = J2ij CijUi^aj — > p'(0) = X^ij c^^ai^aj. It is easily checked that = 
C2j and = —c^j- Therefore, the acquisition of p'{G) yields c/^^q, c'13, c'20, C23, Cq^ 1 f^;^ , c'02 , C32 
or cio,ci3,-C3o,-C33,coi,C2i,co2,C22- Note that 030,033,021,022 add to what can be deter- 
mined in Eqs. ( |6.6[ )-(|6^) without the extra pulse before acquisition. Applying other readout 
pulses before acquisition allows different sets of readout coefficients to be obtained. 

To perform quantum state tomography - that is, to determine p{Q) - one can prepare 
/9(0) nine times and apply no pulses or 7r/2 pulses about the x or y directions on the two 
spins independently to find all Cij 7^ (0,0). 

The method for state tomography can be generalized to n qubits provided the coupling 
between any two spins is non-negligible. In general the resources required are exponential 
in the number of qubits, since the number of parameters to be determined in the density 
matrix is exponential in n. Q 

6.2.4 Thermal Initial States 

In bulk NMR quantum computation at room temperature, a pure initial state is not avail- 
able due to large thermal fluctuations {fiuJi <C ksT the thermal energy). Instead, a con- 
venient class of initial states are the states at thermal equilibrium (thermal states). The 
thermal state density matrix pth is diagonal in the energy eigenbasis, with diagonal entries 
proportional to the Boltzmann factors: 

Pth = ^e'^ (6.10) 

^ For a system of n qubits, there are n2"~^ spectral lines if all pairwise couplings are non-negligible. 
However, there are 2^" — 1 free parameters in the density matrix to be determined. 
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where Z is the partition function normaHzation factor. At room temperature, {^^) ~ 10 ^, 
Z K, Dim(pth) = 2" and pth = ^{I — fc^) to fi'^st order. Moreover, since (He) ^ (^z). 



Pth 



1 

2" 



kBTj 



(6.11) 



6.2.5 Example 



As an example of the above theories, consider applying the pulse sequence in Fig. 6.2 to 
the thermal state of the 2-spin system considered in previous sections. 



CN 



Rya ( 2 ) ^ -R'xa ( 9 ) 



Figure 6.2: Pulse sequence for CN. Time runs from left to right. Rij{9) stands for a rotation 
of angle 6 about the i-axis for spin j. r stands for a time evolution of duration 27 which 
results in the operation !Sab- 

In a heteronuclear system, the pulses are short compared to other relevant time scales. 
Therefore, other changes of the system during the pulses are ignored. The unitary operation 
implemented by the above sequence is given by (a, h are the first and second qubits) 



CN 







if o-^® 








1 - i 











1 











l-i 










l + i 










1 - 


i 






similar to CNOT;,a up to phase factors on each computational basis state. 



(6.12) 



(6.13) 



The thermal initial state is given by Eq. ( 6.11 ). Since the identity component is invariant 
under unitary transformations and gives no signal, we only consider the term proportional 
to —TCz, and omit the constant of proportionality: 



UJ„ 



Pth 



CTz 



2 



1 



Diag(Wa + UJb, UJa - Wfe, -UJa + Wfe, 



-UJa 



(6.14) 
(6.15) 
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The sequence transforms pth to 



— az az + — I ^ az 



^^b, -t^a +^^b, -^b) 



(6.16) 
(6.17) 
(6.18) 



in which the populations of |01) and |11) are interchanged. Note that CN and CNOTfta effect 
the same evolution on the thermal state, because the extra phases in CN are irrelevant for 
a state diagonal in the computational basis. This fact can be used to simplify the pulse 
sequences for the initialization procedures to be discussed in Section |6.5| . 

By inspection of Eqs. ( |6.14| ) and ( |6.18| ), it can be seen that pth and pcn have zero peak 
integrals given by Eqs. (|6.6D - (|6.9D . To obtain information about the states, a readout pulse 
-Rxa(f ) can be applied to transform the two states to 



Pth 



Pen 



2 ^ 
fJ,, I 

2 ^ 



^b -r 

/ + y I CT^ 

^b r 



(6.19) 
(6.20) 



In p'^^ is a term ay ® I with coefficient C20 = which contributes to two spectral lines 
at 1^ lb ^ with equal and positive, real peak integrals. The readout pulse transforms 
the unobservable coefficient C30 in pth to the observable —c^q in p^^, yielding information 
on the state before the readout pulse. Similarly, has a cjy ® az term with coefficient 
C23 = — ^ which gives rise to two spectral lines with real and opposite peak integrals 
(Fig. |6.3| ). Outputs in NMR experiments are peak integrals of this type carrying information 
on the states. 



i^a I J 

27r 2 



27r 2 



2n 2 



l^g J 

2tt 2 



Figure 6.3: The spectra of spin a after a readout pulse (a) on the thermal state pth and (b) 
on Pen- 
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6.3 Deviation density matrix and effective evolutions 



The thermal initial state in Eq. ( |6.11 ) decomposes into the identity term ^ and a small 

1 7-/ 

traceless term — 2" fciT called the deviation density matrix. As the identity does not con- 
tribute to any signal, it is the induced effective evolution on the deviation which is observed 
experimentally, rather than the original quantum operation. In this section, we discuss the 
relation between the observed induced evolution and the original quantum operation, due 
to Nielsen [|86[| . 

The problem to infer the original quantum operation from the observed effective evolu- 



tion in NMR will be addressed in Section 6.4, as original contribution from this Disserta- 



tion |27|. We apply the theory in effective evolution to analyze quantum error correction in 
NMR in Chapter |. 

As it turns out, the distinction between the original quantum operation and the observed 
induced evolution can be neglected in most applications. The identity does not contribute 
to any signal, and is invariant under the large important class of unital processes (see Sec- 
tion ^^ ), in which case the identity can be neglected |2^. In NMR, the discrepancy from 



unitality is caused by thermalization (generalized amplitude damping defined by Eq. ( 2.39 ) 
in Section |2.4| ). In most applications such as quantum algorithms, the relevant time scales 
are much shorter than the thermalization time scale, and unitality is a good approximation. 
Such operation regime will be considered in Sections |6.5| - |6.7| . 

6.3.1 Effective evolution 

For a non- unital but trace-preserving process the observable evolution of the deviation 
can be understood as follows. Rewriting 

p = + PA , (6.21) 

where pj\ = p — vl is the traceless deviation from the identity and v = l/Dim(/9), 

£{p) = v£{I)+£{pa) (6.22) 
= vI + vi£iI)-I) + £ipA) (6.23) 
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= vI + £a{pa) (6.24) 

where 

£A{pA) = vi£iI)-I)+£{pA), (6.25) 

and PA — > £a[pa) describes the observed evolution of the deviation. = iff is unital. 
This is because £a{^) = "^^{^{1) ~ I) ^ cannot be hnear for any non-unital £, and cannot 
be equal to £. 



6.4 Quantum process tomography in NMR 



Recall from Section 2.3.2 that quantum process tomography (QPT) is an experimental 



procedure to determine the unknown quantum process of an open quantum system. We 



consider using Method I for QPT described in Section 2.3.2 to test quantum gates and 
understand decoherence in NMR. The procedure relies upon the ability to prepare a com- 
plete set of basis states pin as input to the unknown process £, and the ability to measure 
the output density matrices £{pin)- In NMR, there are two practical problems associated 
with tomography. First, unitary operations only manipulate the deviation, preventing a 
complete set of basis states from being prepared by unitary actions alone. Second, one can 
only measure traceless observables, and thus one cannot obtain the entire output density 
matrix of a process. 

In this section, we show how these hurdles can be overcome theoretically. We first review 
the original QPT recipe of interest. Then we explain our extensions to the basic procedure 
which enable QPT with NMR. Experiments using QPT to investigate the fidelity of the 
CNOT gate and the validity of the independent error model in NMR are underway |27]. 

6.4.1 Review of QPT 

Let the unknown quantum process be given by the operator-sum representation £{p) = 
J2k ^kpAf.. In this context, £ can always be taken as trace-preserving. £{p) can be re- 
expressed as £{p) = J2mnXmnBmpBli, where i?m are a fixed basis for operators on p, and 
Xmn are entries of a positive Hermitian matrix. In the new representation, the information 
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about the process is represented by the coefficients Xmn instead of the operation elements 

Let the A^^ matrices pj be a basis for density matrices. The result of applying £ to pj 
can in turns be expressed in terms of the basis: 

k 

The coefficients Aj^, which fully specify f , can be determined experimentally using quantum 
state tomography (see Sections 2.3.2| and 6.2.3| ). To determine x from A, let BmPjB\ = 



Ylkl^Jk^Pk such that J2mnPjk"'Xmn = ^jk- Taking (3 as a matrix and X,x as vectors, with 
composite column and row indices mn and jk, we have (3x = A, which can be inverted to 
obtain x- 

6.4.2 QPT in NMR 

Consider Eq. ( |6.26 ) again: 



£{Pj) = Y.^jkPk. (6.27) 

k 

The original goal is to prepare a complete set of basis states pj and to measure £{pj) 
to obtain full information of Xjk- We describe the complications in NMR and possible 
resolutions, ffist in an abstract setting, followed by concrete methods in NMR. 



Equation ( 6.27 ) can be thought of as a linear representation of £, in which pj form 
a basis and Aj^ are entries of a matrix. In real life, one prepares independent pj and 
obtains £{pj) where pj and £{pj) are physical states. However, in this abstract setting, we 
change basis to an orthogonal one, with respect to the inner product {A,B) = TriA'fB). 
Expressing p and £{p) as vectors, £{p) = Xp. We choose pi = I. By the orthogonality of pj, 
Tr(pj) = V j > 2. Since £ is trace-preserving, Tr{£{pj)) = V j > 2, and have vanishing 
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pi component. Therefore, the matrix representation of A has the form 



1 











A 



(6.28) 



R 



M 



Experimentahy, only the traceless components of the output can be measured, with an 
amplification factor depending on many experimental details such as the amphfier gains, 
the RF coils, and the sample size. In other words, at most dR, dM are obtainable, where 
d is the unknown amplification factor. Moreover, if we can only manipulate the traceless 
deviation of the physical input unitarily, we only obtain d{M + R') where R' is a square 
matrix with every column equal to R/2"'. The first problem can be resolved by calibrating 
our apparatus with some known process to obtain d. The second problem can be resolved 
by perform an extra non-unitary procedure to prepare the input ^ to obtain dR. How 
these adaptions translate to the NMR case is described next. 

In NMR, the thermal initial state for a system of n spins is given by 



to prepare 2" — 1 linearly independent inputs. We augment the basis set with the identity 
matrix, prepared by applying a uniformly spatially varying RF pulse to the sample. The 
pulse rotates different parts of the sample by different amounts, so that effectively, the pulse 
acts as random X errors which depolarize the thermal state to the maximally mixed state. 
One can also calibrate the apparatus using the known initial state of the system resulting 
from thermalization. By state tomography, we can measure the traceless part of the thermal 
state given by Eq. (|]2|), and obt aiii ^ 2kBT '-^j-^''"'''- Comparing to the known deviation 
2^ 2FT can be determined. 




(6.29) 



By performing unitary operations on the system, we may rotate the ^jUJjZ^^^ deviation 
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6.5 Effective Pure States and State Labeling 

In this section, we outline one possible method to compute on the highly mixed thermal 
input states. We consider states which decompose into an "active" computing component 
and a "quiet" remainder. For example, when the density matrix decomposes into an iden- 
tity and a pure-state-like deviation, only the latter evolves and contributes to the output 
signal under unital processes, precisely simulating the computation on a pure state. The 
possibility to perform quantum information processing task directly on the mixed state 



input is discussed in Section 6.7. 



6.5.1 Effective Pure States 

An effective pure state is a state that behaves for all computational purposes as a pure 
state. A computation (possibly with initial state preparation and measurement procedures) 
is generally a trace-preserving quantum operation C. The density matrix is an effective 
pure state for a computation C corresponding to a pure state if; ^or all meaningful 

observables Oj, C induces another computation C and observable such that 

Tv{C'{pM = «IV(C(|V)(VI)0.) (6.30) 



for some fixed known constant a. In other words, proportional outcomes are obtained 
whether C is run on 1-0) or C is run on for all meaningful measurements. 

Physically, the standard measurement is to project onto the |0) or |1) states of the 
measured qubits, therefore Z is the measured operator. In NMR, X and Y are the measured 
operators. Therefore, we restrict to traceless observables Oj. 

For example, the following state is an effective pure state for any computation C which 
is unital: 

p, = i^/ + a|V)(Vl (6.31) 

This is because 

1 - a 

C(p,) = — ^/ + aC(|V)(Vl) and (6.32) 
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Tr [ C{p,)Oi \=aTr[ C{\^p){^p\)Oi \ (6.33) 

This effective pure state requires no modification of the original computation (C = C). 

Our eventual goal is to find modifications to the computation so that the input of interest 
(such as the thermal state) is an effective pure state. This can be achieved, for example, 
by finding a preparation procedure V which transforms the initial state to the form in 
Eq. ( |Ol| ). Alternatively, we may also consider other effective pure states which require 
non-trivial readout procedures TZ and other modifications to C. We devote the rest of this 
section to the preparation of effective pure states. 

6.5.2 State Labeling 

We consider preparing an effective pure state from an arbitrary initial state p. There are 
three major techniques: logical labeling, temporal labeling and spatial labeling. Special- 
ization to thermal initial states in NMR will be discussed. A hybrid method, as original 
work in this Dissertation (unpublished), will be described. Such method was independently 



reported in [71|. 



Logical Labeling 

In this method, extra qubits are used to label the subspaces of the input states. One 
prepends the computation C with an initial preparation step V and appends C with a 
readout step TZ, such that 

C'{p) = n o {li^kei ® C) o V{p) . (6.34) 

The purpose of V is to modify p by concentrating some of its randomness into the label 
states. In particular, we may choose V such that 



k 



(6.35) 
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where po = a\ip){Tp\ is the desired initial state and pk for /c > 1 are some "garbage" states. 
The \k) are labeling degrees of freedom. The computation Iiabei tX" C is to operate only on 
the Hilbert space of pk, leaving the label states invariant. We have 

p' = {^iabei®C)oV{p) = \k){k\ ®C{pk) (6.36) 

k 

For a measurement of Oi on the computation degrees of freedom, the readout preparation 
IZ is chosen as 

n{p') = ]^[p' + U,p'ul] (6.37) 
where Ui = |0)(0| ® I + X]fc>i ® Ai anticommutes with Oj. Then, 

Tr[C'(p)(/(^0,)] - Tr [ 7^(p') X (/ CO ] (6.38) 

= Tr[p'x ]^[I®0^ + \Q){Q\®0^ + Y,\k){k\®{A\O^A,)] ] (6.39) 

fc>i 

= Tr[p'x (|0)(0|®O,)] (6.40) 

= Tr [ C(po) 0^ ] (6.41) 

where Eq. ( |6.39| ) is obtained from Eq. ( 6.37 ) and cyclic permutation of the operators inside 



the trace. It can be viewed as applying the dual of TZ to the observable, an extension of the 
Heisenberg's picture to non-unitary quantum operations. 

Note that in logical labeling, one only needs to prepare a state of the form given in 
Eq. (6.31) in the subspace labeled by |0)(0|. This comes at a price - extra qubits and special 



readout procedures may be required. 

For NMR, the initial state p is the thermal state. When extra spins are used as the 
labeling degrees of freedom, the operation TZ may be omitted because the spectral lines are 
separated for different \k) in the labeling states (see Section 6.2.3| ). Alternatively, TZ can be 



implemented in two separate runs of the experiment, each with the identity or Ui in place of 
TZ. The measurement outcomes are then added to obtain the desired result. This method 
to add up the outcomes to simulate a quantum operation is closely related to temporal 
labeling as described next. 
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Temporal labeling 

This method requires no extra qubits nor special readout procedure, but it requires a number 
of repetitions of the experiment. The idea is to add up the results of a series of experiments 
that begin with different preparation operations Pk before the same intended computation 
C. By linearity, 



Y,TT[o,C{PkPthPl)] = Tt[o,C(j2PkPthPl 



Tr 



(6.42) 
(6.43) 



In other words, summing over the experimental results (on the left side of Eq. ( 6.43| )) is 
equivalent to performing the experiment with the unnormalized initial state pe = J2k PkPthPl 
(on the right side). The aim is to choose Pk such that p^ is the effective state in Eq. ( |6.31 ). 

In the case of diagonal initial states, such as the thermal state, P^ can be chosen to be 
cyclic permutations of the states |Z) for ^ > 1, which fix |0). For example. 
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(6.45) 



Then, 



EP'^PPI=E 



ai 
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(6.46) 
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which is an effective pure state of the form of Eq. ( |6.31 ). Such permutation works well for a 



small number of qubits, but requires resources exponential in the number of qubits. In |71| 
more efficient methods are presented. 



Spatial labeling 



Effective pure states have been created by Cory et al [42| by applying different unitary 
operations Pk which vary continuously with a spatial degree of freedom k. This is experi- 
mentally implemented using a static gradient magnetic field before the computation, which 
rotates each spin by an amount determined by the location of the molecule in the physi- 
cal apparatus such that the integral over k is an effective pure state. No extra qubits or 
repetitions are required. 



Hybrid method 

This method is a special procedure for thermal initial states, using both logical and temporal 
labeling. For a system of n qubits, it creates an effective pure state of n — 1 qubits using 2 
temporal labeling experiments. The number of gates required for the preparation is linear 
in n. 

Recall that pth = c{I + "yJ2i^i^^^^)i where c = and 7 = 2i^- "^^^ preparation 
procedure is defined by 

(6.47) 



V{p) = ^U2{p + UipUl)ul 
where Ui = 0^=2 CNOTi^, U2 applies X to the first qubit if all other qubits are in |1): 
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Intuitively, V works as follows. The density matrix of the thermal state looks like: 



-Pth 
c 



I 








/ 



+ 7 



UJlI 








—toil 



+ 7 



M 








M 



(6.48) 



where 2"' x 2"' block matrices are shown and 



M 



id2 +UJ3-\ \-UJn • • • 

U}2+UJ3-\ i^n • • • 

—U>2 — <^3 — \-<^n 

-LO2-UJ3 UJn 



(6.49) 



is the restriction of X^j<2 

Z(^) to the space of the last n — 1 spins. When the first spin is 
in state |1), Ui applies n?=2-^^*^ which negates M by conjugation. Therefore, 



1 



-UipthUi 
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/ 
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(6.50) 



(6.51) 



The final step U2 exchanges the last diagonal elements of the upper and lower diagonal 
block matrices. 
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(6.52) 
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where 





M9= 0-0 (6.53) 

• • 

• 2u;i — U!2 — ■ ■ ■ — UJn 

represents a pure-state deviation in the last n — 1 spins. Therefore, conditioned on the first 
spin being |1), we obtain an effective pure state for the other n — 1 spins. 
Required Resources For the preparation V, Ui takes n — 1 elementary gates. For U2, 
recall that extra phases are irrelevant on diagonal density matrices. Therefore, we can 
replace the conditional X^^^ by a conditional iX^^^ £ SU(2). The resulting operation takes 
0{n) elementary gates [|l^]. For the readout operation, provided each spin couples to the 
first spin, the spectral lines conditioned on the first spin being |0) or |1) are split. No 
extra readout procedure is required and we need only two temporal labeling experiments. 
Otherwise, four temporal labeling experiments are required. We obtain an effective pure 
state of n — 1 spins out of n spins. 

The hybrid method illustrates that labeling can be performed using very simple tech- 
niques. 



Remark 



State labeling is not just a way to create effective pure states; it is also a method to 
create robust quantum computation procedures. Equation ( |6.30| ) can be understood as 
transformation from a given quantum computation C (which nominally operates on pure 
state inputs) into another one, C , which is robust in the sense that it can operate on a class 
of mixed state inputs. This notion of robust quantum computation significantly expands 
the physical systems suitable for quantum computation. This idea is also related to the 



algorithms described in Section |6.7| which are robust against errors in the initial states. 
The fact that logical labeling can be viewed as an error detection procedure is discussion 
in Section B.6|. 



6.6. SIGNAL STRENGTH AND SGALING 



137 



6.6 Signal Strength and Scaling 

Even though all effective pure states are mathematically equivalent, physically, the signal 
has to be above the detection noise (in other words, a in Eq. ( |6.3[l| ) has to be large enough). 
We discuss the limitations of the signal strength when using the labeling techniques dis- 
cussed in the previous section. 

For a sample of N molecules each with n spins in a static field B, the transverse mag- 
netization of the i-th spin is given by 

M,i = N-finTvipX^"^) , (6.54) 

where 7, is the gyromagnetic ratio of the i-th spin. f\ Tr(/jX(*)) is upper bounded by the 
largest eigenvalue of the deviation density matrix of p. The labeling methods described so 
far are unital processes, which cannot increase the largest eigenvalue |87|. Therefore, the 
upper bound can be rewritten as 

where uji = 7^6 and 7 is the mean gyromagnetic ratio of the n spins. For protons at room 
temperature in an 11.8 Tesla field, ^hB/ksT ~ 10"^. 

Usually, NMR signals are detected inductively in some pick-up coil with K turns and 
area A, in a resonant tank with a quality factor Q. The time-varying magnetization leads 
to a changing flux <!> in the coil which produces a peak-to-peak voltage 

d$ d 

V = QK— = QK-uoM,,A . (6.56) 
at at 

In the lab frame the readout magnetization oscillates at the Larmor frequency jiB, therefore 
the amplitude of the oscillating voltage in the pick-up coil will be 



A. (6.57) 



^ The energy difference of the spin states is given by fiu = Ti-yYi = where 7 and are the gyromagnetic 
ratio and magnetic dipole moment of the spin. 
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Major improvements can be brought by using electron spins to cool down and read out 
nuclear spins. The electron gyromagnetic ratio is ~ 10^ times larger than those of the nuclei. 
Transferring the electron thermal polarization to the nuclear spins can increase the initial 
polarization by a factor of 10^. Transferring the output from the nuclear spins to electron 
spins gives another factor of 10^ improvement. For quantum computation, the bandwidth 
of interest is known a-priori and high-Q resonators can be used instead. The quality factor 
Q can be made at least 10^ times better. Since 2^*^ ~ 10^, each three orders of magnitude 
increase in the signal can accommodate about 10 more qubits. Therefore, 30 — 40 qubits are 
possible without large scale modifications of the experimental set up. Further scaling up 
will necessitate high spin polarizations, and may require optical pumping, phase transitions 
in ordered systems or cooling down to millikelvin temperatures. More sophisticated initial 



state preparation technique using algorithmic cooling [100| has been developed which yields 



0{n) spins in the ground state starting with molecules of n spins. 

6.7 Hot qubit algorithms 

We have seen how the stringent limitations on the initial states can be circumvent using 
effective pure states. Likewise, problems in bulk measurements can be resolved by de- 
terminizing quantum algorithms such that computation results do not average away |53|. 
However, such methods come at a cost: either an exponential reduction in signal strength 



or a linear reduction [100| in the number of usable qubits or extra computation steps, with 
all other resources held constant. It is an intriguing open question how limited the bulk 
quantum computation model at high temperature is compared to the standard quantum 
computation model. In this section, we exhibit non-trivial overlap between the two compu- 
tation models: they are polynomially equivalent for a class of quantum algorithms including 



the well known Deutsch-Jozsa (DJ) algorithm |46|. These "hot qubit algorithms" tolerate 
independent bit flip errors in the input states and remain informative when ensemble mea- 
surements are used. Our result does not resolve the general question of equivalence between 
bulk computation at high temperature and standard quantum computation; rather, it pro- 
vides new insight into the possibility to trade for important simplifications at the price of a 
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restricted computation model while retaining polynomial equivalence to the standard com- 
putation model. The discussion is motivated by bulk NMR quantum computation, though 
it applies to any physical system with similar abstract descriptions. 

6.7.1 Computation models 

We first define our computation models: 

1. Standard quantum computation (SQC), using a single quantum system with pure 
initial state (taken to be lO)®") and projective measurements. 

2. Bulk quantum computation (BQC), using a large number of identical and independent 
single quantum systems, and ensemble measurements. Depending on the available 
initial states, the BQC model can be subdivided: 

(a) BQCp with pure initial state p^Q'='p = (|0)(0|)'^" (for example, see |Tll ). 

(b) BQCt with thermal initial state p^^^T ^ in which the i-th qubit is in a statistical 
mixture of |0) and |1) with probabilities pi and (1 — Pi), independent of other 
qubits. 

(c) BQC^; with effective pure state as input. The effective pure state with parameter 
a is given by p^QC-^ = q(|0)(0|)®" + ^(1 - a)I. 

In SQC, projective measurement of the i-th spin (onto |0) or |1)) is a measurement of Z^*). 
Note that Z\x) = (1 — 2x)\x) for x G {0,1}. In BQC, the corresponding output from the 
final state p is Ei = Tr(pZ(*)). 

6.7.2 Noise sources in BQC 
Algorithmic uncertainty 

An algorithm which outputs a superposition of eigenstates of the measured operator is in- 
trinsically random. Using a bulk sample, the measurement outcome has variance inversely 
proportional to the sample size. The algorithmic uncertainty can therefore be made neg- 
ligible using a sufficiently large sample (independence of problem size), a situation which 
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we assume for the rest of the discussion. Furthermore, we assume the maximum allowed 
sample size is used. We call a single run using such a sample a "single shot experiment" . 

Channel noise 

A realistic measurement apparatus has imperfections which can be modeled as some noisy 
channel. The situation is like sending the correct experimental outcomes from the system 
to the experimentalist through a noisy channel (the measuring apparatus). Therefore, the 
random noise due to the apparatus is called "channel noise" . 



System output 
(correct value) 



Device 
(noisy channel) 



Measured 
value 



Figure 6.4: Measurement errors caused by an imperfect device can be modeled as sending 
the correct result through a noisy channel. 

We model channel noise as a random variable N with zero mean and variance A^^y. 
Channel noise restricts the distinguishability of different outcomes. Suppose S is the correct 
outcome, then the measured outcome is S' = S + N, with mean S and variance A^^y. 
In most applications, S is digitized by identifying S with the closest value in a discrete 
(ordered) set {di,d2, ■ ■ ■}■ Usually, d = ^ min^ \di — sets the required accuracy level. 
The probability to infer di incorrectly from S' depends on the ratio d : Aqn- As this ratio 
drops below unity, the error probability becomes large. In this case, K (^f^)^ repetitions 
of the experiment should be run to achieve a certain probability of success. Acn is generally 
independent on the algorithm and the problem size n, while d can depend on both. For a 
fixed algorithm, if d = 1/ f{n) then K ~ /{n)^- 

Initial state noise 

One can unify the various BQC models by viewing BQC^ and BQCt as computation on 
BQCp prepended with noise processes 8^{p^^^^) = p^Q'^M for ft = T,E. 

Let the noiseless channel on a single qubit be denoted by I, and the bit flip channel with 
flip probability 1 — p be denoted by Xp{p) = pp+{l — p)XpX. Then, 6^ can be represented 
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as 



St = Xp,0---(g) Xp^ (6.58) 
£e = aX®" + (1 - a);ff " . (6.59) 

2 

The initial state noise propagates in an algorithm in the following manner. Let C represent 
a computation in BQCp, which outputs p^'^^^ to give the outcomes 

Ef = TiiMjp^^^P) , (6.60) 

where Mj is a measured operator on the j-th qubit. Then, running C in BQC^ is equivalent 
to running C o S^^^ in BQCp. Furthermore, if J-fj_ exists such that 

Co£^ = J^^oC, (6.61) 

then, the mixed state computation outputs the state 



Pf^^''=-^.(pf«^-) (6.62) 



In other words, mixed state computation is like pure state computation appended with J^^. 
C propagates £^ to J-^. Furthermore, the measurement outcomes from BQC^ are 

= Tr(M,pf«^M) = TT{M,T,{p^Q^n) = Tr(.Ft (M,)pf Q^-) . (6.63) 

Comparing with Eq. (|6.60| ), computations in BQC^ is equivalent to computations in BQCp 
with modified measured operators. 

6.7.3 Hot Qubit Algorithms 

In this section, we introduce the hot qubit algorithms (HQA) for which BQC^, BQCp and 
SQC are polynomially equivalent. We first define HQA, and provide an important example 



afterwards. Other examples can be found in |114|. 



If C is unitary, JT^ — C o £^ o C'' always exists and is unique. 
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Definition 

We consider ensemble measurements Ei{p) = Tr(pZ''*^) and unitary computations C{p) = 
U pU'^ . Let J-T be as defined in the previous section. That is, 

Co£t = TtoC. (6.64) 

Then, the quantum algorithm C is an HQA if it satisfies the following two conditions: 

1. BQCp is polynomially equivalent to SQC when implementing C. 

2. Ei{J^T{p)) = liEi{p) where 7^ is independent of the input size. 



From Eq. ( 6.62| ) and condition 2, we conclude that for HQA, BQCt and BQCp are equiva- 



lent. In other words, C propagates £t to Ft which only causes a constant signal reduction. 
Together with condition 1, BQCy and SQC are polynomially related in implementing HQA. 

Example: The Deutsch-Jozsa algorithm 

We now illustrate the meaning of hot qubit algorithms with an example, the Deutsch-Jozsa 
algorithm [H^, IsH]. The goal is to solve the Deutsch-Jozsa problem [35]: 



/ : {0, 1}" — > {0, 1} is a binary function on n bits which is either constant or 
balanced (balanced means that \{x : f{x) = 0}| = \{x : f{x) = 1}| ). The 
question is to find out whether / is constant or balanced, using "oracle calls" 
which output \x)\f{x) © w) upon the input of 

2"~^+l classical queries are required in the worst case to solve the problem deterministically. 
However, there is a single-query algorithm under the SQC model: 1. Starting from the state 
|0)®"|1), apply to 

prepare aquery 2. Call the oracle C/j, obtaining 

Ex k)(|0) - |1)) ^ Ex(-1)-^^^'^N)(|0) - 3. Apply to the state. These three 

steps can be represented as: 

|0)«-|1) i-^-^^^ _l_^|^)(|o)-|l)) (6.65) 
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^ ^l=^(-l)/W|a;)(|0)-|l)) (6.66) 

a; y 

where all summations are over {0, 1}". The final state of the input register (the first n 
qubits) is therefore 

I'^f'') = 4EE(-i)^^'^®'''ly) = E5(y)ly) , (6-68) 

y X y 

with 

5(2/) = ^E(-l/^'^®"'- (6-69) 

X 

If / is constant, 13(0)1 = 1 and if / is balanced, \g{0)\ = 0. Projecting the input register 
along the computational basis gives a definite value of y. The cases y = and y ^ 
correspond to the deterministic answers / being constant and balanced respectively. 

We now consider running the algorithm in BQCp. Recall that we measure Z^^ (note 
that (y) ^ {{Z^^^), ■ ■ ■ , {Z^'"^))). As g{y) is a complicated function of /, it is not obvious if 
the measurements are still informative. Surprisingly, an affirmative answer can be obtained 
by the following simple argument. For SQC, the output is |y = 0) for / constant and \y > 1) 
for / balanced. They are eigenvectors of the coarse grained operator Z^ = J2i ^^^^ with 
eigenvalues A = n and X < n — 2. More importantly, E{Zt) = J^i^i ^^nd one needs to 
distinguish between E{Zt) = n and E{Zt) < n — 2. The digitization therefore requires an 
accuracy level of cZ = 1. If the channel noise causes a standard deviation of Acn in each 
measured Ei, the measured E{Zt) has standard deviation -s/nAcN- When n is large and 
d : Acn ^ 1, C){n) repetitions are sufficient to amplify the probability of success. Thus, 
BQCp can solve the DJ problem with 0{n) queries. The Deutsch-Jozsa algorithm satisfies 
condition 1 for being an HQA. 

For condition 2, we first show that the noise does not propagate. In other words, 
£t ° C = C o £rp and St = J^t- We then show that S affects all measurements only by a 
factor independent of n. 
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First of all, the (n + l)-th qubit (or the work bit) is unaffected by the algorithm, and 
can be omitted from the discussion. We first assume that the work bit is pure and take 
into account a possibly mixed state at the conclusion. Let U f and U denote the restriction 
of Uf (the oracle) and U (the computation) to the input register: U\x) = (— 1)-^^^-* |x) and 
U = H^^UfH^'' following Eqs. (|6:65|)-(|6:67|). For each i, [X^'\U] = [X^^ , H^'^U f H®"^] = 
H®''[Z^\Uf]H®'' = 0. Recah that £t = (^iXp^ and Xp{p) =pp+{l- p)XpX. Therefore, 
each operation element of £t commutes with U ; thus, the operations £t and C commute. 
Hence, the signal from the i-th qubit is given by 

^bqCt ^ Tr{Z'^^£T{Po^^^)) (6.70) 

= Tr(4(^^*^)/of^^^) (6.71) 

= (2pi-l)Tr(Z«pf«^^) (6.72) 

= (2pi-l)sf^^^ (6.73) 

The second to last line holds because £t is an independent qubit process, and X,p[Z) = 
{2p — 1)Z. We conclude that the outputs of BQCy and BQCp are identical up to the 
scaling factors {2pi — 1) and BQCr and BQCp are equivalent. Condition 2 for being an 
HQA is satisfied by the Deutsch-Jozsa algorithm. 

For simplicity, suppose Pi = p Vi. When the ancilla starts in a thermal state, the 
probability is p for it to be |1) as in Eq. (|6.65| ) and 1 — p for it to be |0). In the latter, 
the oracle does nothing to the input register (|x) — > |x)) and it outputs the signal of 
a constant function. We need to distinguish between J2i — (2p — and J2i — 
(2p — l){p(n — 2) + (l—p)n) = (2p—l){n — 2p). The smaller accuracy threshold d = j»(2]5 — 1) 
will only increase the number of repetitions by a constant factor. 

Discussion on HQA 

We have established that BQCy and SQC are polynomially equivalent in solving the DJ 
problem; however, we have not been able to show that BQCt outperforms its classical coun- 
terpart. The reason is, BQCy and SQC are equivalent in a probabilistic setting, while SQC 
is exponentially better than classical computation only in the deterministic case. BQCr 
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loses its advantage over classical computation when small probability of error is allowed, 
since probabilistic classical computation can solve the DJ problem with only a constant 
number of queries. 

The difficulty in finding HQA versions of other important algorithms lies in having 



to satisfy both conditions 1 and 2 simultaneously. For example, Simon's algorithm [106|, 
unmodified, satisfies condition 2, so that BQCt and BQCp are polynomially equivalent. 
However, the ensemble measurement in BQCp erases all the useful information in the 
answer for most inputs. Therefore, BQCp fails to solve Simon's problem with the original 
algorithm. While Simon's algorithm can be modified to function in BQCp, the modified 
algorithm no longer satisfies condition 2. Thus, we have not been able to construct an HQA 
for Simon's problem. 

6.7.4 Discussion on bulk computation models 

To summarize, 

• BQCp is polynomially equivalent to SQC in running all the known quantum algo- 
rithms, which can be determinized with polynomial overhead. 

• BQC^; with large a can efficiently simulate BQCp. 

• BQCt can simulate BQC^; (with parameter a) by preparing effective pure states. 
However, a < ^ where (1 + e)/2 = maxipi and n is the problem size. Therefore, 
BQCr can efficiently simulate BQCp and SQC only for sufficiently large 

• For HQA, BQCt and SQC are polynomially equivalent. In contrast, transforming 
BQCt to BQC_E to implement HQA requires an exponential number of repetitions. 

In the rest of this section, we discuss the difference between BQCt and BQC^;. We can 
consider the distance or the fidelity (using for instance, the definitions in |8^) between 
pBQCT^ pBQCE and |0)®", 1/2". 

First of all, for p^Q'^T ^ the entropy nH{p) never approaches n and the distance between 
pBQCr and the "useless" state 1/2"' is greater than 2p — 1. In contrast, converting p^^^^ 
to p^Q'-'B increases the entropy from nH{p) to more than (1 — a)n where a "-i^ o. The 
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distance between p^Q'^E ^j^j //2"' is vanishing. A more interesting fact is that both p^Q^T 
and p^Q^E have vanishing overlap with |0)'^"', yet there is an exponential gap in their 
computation capacity. The implication is that, states other than |0)®" in p^Q'-^r must be 
contributing in an HQA, but they are only arranged to be "quiet" in p^Q^E _ 

Finahy, p^Q'-^T jg product state but p^^^E cannot be written as pi ® ■ ■ ■ ® Pn- This 
in turns means that £e cannot be written as a product channel, or else p^^^E _ ^^^^^q^®™^ 
will be a product state. Rather, £e represents a classically correlated noise and propagates 
to diminish the output signal. 

Even though it remains inconclusive what BQCt can accomplish, the lesson is that the 
poor scaling in BQC^; is due to the inefficient conversion of the thermal state to an effective 
pure state for a large number of spins. As the thermal state is the stationary state of a 
natural decoherence process, and is easily obtained, it is important to find efficient simu- 
lations of SQC using BQCt, perhaps via yet another computation model which tolerates 
more noise than BQCt but can be more efficiently simulated by BQCt- 



6.8 Summary and preview 

In this chapter, we have seen how various techniques have made it possible to perform quan- 
tum computation on bulk NMR systems with thermal initial states. We have discussed logic 
operations, measurements, state tomography and process tomography in NMR. We have 
seen how the problems associated with mixed state inputs can be resolved for small scale 
computations by "state labeling". The possibility to perform some quantum algorithms 
directly on a thermal state was also demonstrated. 

In the next chapter, we will describe an efficient method to selectively perform coupled 
logic operations in NMR. In Chapter |^ we will describe an experiment on quantum error 
correction. The following facts or notations will be frequently used for the next two chapters. 

• Hamiltonians of an NMR system: 
Zeeman energy: TLz = —\ 

J-coupling: Tic = Y.i<j 9ijZ^''^ Z^^'^ where gij = ■nJij/2 



SUMMARY AND PREVIEW 
Time evolution: r 

Primitive coupled operation: = e~'f 

Thermal initial state: pth = '^b^ (i — 
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Chapter 7 

Logic gates in NMR - 
and recoupling 



decoupling 



7.1 Introduction 

Quantum computation requires the ability to perform coupled logic operations, which can 
only originate from the natural couplings in the quantum systems involved. In most cir- 
cumstances, the natural interactions do not couple specific pairs of qubits as desired in most 
applications of quantum computation. Rather, many couplings occur simultaneously along 
with the intended one. Moreover, the problem of simultaneous and undesired coupling gen- 
erally becomes worse with larger systems and stronger couplings, which are essential for 
quantum computation to be useful. The fundamental task to turn off spurious evolution is 
so difficult that, coercing a complex system to do nothing - ceasing all evolution - can 
be just as difficult as making it do something computationally useful. 

In this chapter, we address a simpler problem: to stop the spurious coupling and to 
perform specific coupled logic gates in NMR quantum computation. The task of turning 
off all couplings is known in the art of NMR as decoupling; doing this for all but a select 
subset of couplings is known as selective recoupling. The basic idea is to interrupt the free 
evolution by carefully chosen pulses. These pulses are single qubit operations that transform 
the Hamiltonian in the time between the pulses in such a manner that unwanted couplings 
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in consecutive evolutions cancel out each other. Ingenious schemes have been found [50, 107, 
|8^ but they do not address the problems relevant to quantum computation. The primary 
interest in these schemes is to reveal complex structures in the spectra rather than to achieve 
precise quantum evolutions. Quantum computation brings new requirements, and initial 



efforts |8C] have been made to develop pulse sequences to satisfy these needs; however, these 
schemes have necessitated resources (such as the total number of pulses applied) exponential 
in the number of spins being controlled. Schemes for selective recoupling are generally 
difficult to find for a large system. Each pulse simultaneously affects many coupling terms 
in the Hamiltonian. To turn off all but one of the coupling terms, these pulses have to 
satisfy many simultaneous requirements. 

In this chapter, we present efficient schemes for decoupling and selective recoupling. 
For an n-spin system, in which any pair of spins can be coupled, our schemes concatenate 
cn time intervals and use fewer than cn^ pulses, where c ~ 1 for most n with strict upper 
bound c < 2. Our method exploits simplifications in the couplings in heteronuclear spin 
systems. In this case, we show that the conditions for decoupling and selective recoupling 
are special orthogonality conditions, with solutions given by a class of well-known matrices 
called Hadamard matrices. These are generalizations of the well known Hadamard transfor- 
mation in quantum computation. The efficiency of the scheme originates from the existence 
of general Hadamard matrices in many dimensions. 



This chapter is structured as follows. Section 7.2 contains the precise statement of the 



problem. In Section \l.3[ we first motivate the construction of the decoupling scheme with 
examples, and then derive conditions for decoupling and describe the general construction 
related to Hadamard matrices. Important properties of Hadamard matrices are summarized. 
Modifications of the decoupling scheme to perform selective recoupling are described. We 



conclude with various properties and limitations of the scheme in Section 7.4. 
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7.2 The statement of the problem 



We shall consider a heteronuclear n-qubit system. Recall from Section 6.2.1 that the Hamil- 
tonian of such a system is given by 

H = Hz + Ti-c + Ti-env (7.1) 

where 

nz = -lj2^u;,Z^'\ (7.2) 

i 

denotes the Zeeman terms, and 

i<j 

denotes the coupling terms. 

A single qubit operation is performed by applying a pulsed radio frequency (RF) mag- 
netic field along some direction f/ perpendicular to the static field. To address the i-th 
spin, the frequency of the RF field is chosen to approximate uJijlTi. When the Wj's are very 
different, a very short pulse can be used, so that during the pulse, all other evolutions are 
negligible except for the rotation operator e~*2'^*''''' where d is proportional to the pulse 
duration and the power. The Lie group of all single qubit operations can be generated by 
rotations about x and y. 

Coupled operations such as controlled-z or cnot acting on the i-th and the j-th spins 
can be performed given the primitive 

7Z,, = e-^f . (7.4) 



The ultimate goal is to be able to efficiently realize arbitrary quantum operations on 
an n-spin system with arbitrary couplings. In this chapter, we consider a more limited 
objective, which can now be stated precisely, using the definitions of Eqs. (^)-(|7!J): 
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Given a heteronuclear system of n spins with free evolution e *('Wz+'Hc)t^ pQ]^_ 
trolled using typical RF pulses, how can be implemented efficiently? 

Following NMR tradition, we refer to this task as "recoupling" . 

7.3 Construction of the schemes 

A problem closely related to recoupling is the following: 

Given a heteronuclear system of n spins with free evolution e~*^'=*, controlled 
using typical RF pulses, how can the identity, /, be implemented efficiently? 

We refer to this task as "decoupling". It is conceptually easier to first construct a decou- 
pling scheme. The scheme is derived from Hadamard matrices, which will be reviewed. 
Modifications to implement selective recoupling will be described afterwards. 

7.3.1 Decoupling scheme for two qubits 

To motivate the general construction, we analyze the simplest example of decoupling two 



spins. From Eq. (7^), the evolution operator for an arbitrary duration t is given by r = 
^-igi2tz^^^0Z^^'> ^ Recall from Section [6.2.2 that X is a rotation of 9 = ir along x up to an 



irrelevant overall phase. X^^^ is physically performed by an RF pulse at frequency uJi. The 
important observation is, 

® = (7.5) 

and therefore 

= g-igi2iZ(i)®{X(2)z(2)x(2)) ^7^7) 

= g-igi2t2('>®(-Z(2)) i^jg^ 

= (7.9) 
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where Eq. ( [7.7D is obtained using a Taylor series expansion of the matrix exponent and 
the fact (X^^))^ = /. This observation implies that adding the gate X^'^^ before and after 
the evolution r results in t~^, so that the sequence of events X(2)rX(2)r = / has no net 
coupling although the spins are actually coupled all the time. This is called refocusing in 
NMR, and clearly illustrate how single qubit operations can transform the Hamiltonian so 
that unwanted couplings in consecutive evolutions cancel out each other. 

We now extract the essential features of the above decoupling scheme by rewriting the 
sequence r as 

and referring to r and X^'^^tX^'^^ as time intervals. We note the following facts: 

1. Since the matrix exponents commute, negating the coupling for exactly half of the total 
time is sufficient to cancel out the coupling. 

2. Since the coupling is bilinear in Z^^^ and Z^'^\ it is unchanged (negated) when the signs 
of and agree (disagree). 

3. The sign of Z^^^ is (— ) or (+) depending on whether X^^^ gates are applied before 
and after the interval. In other words, the sign of Z for each spin in each time interval is 
controlled by inserting X gates for that spin before and after that interval. 

In summary, the most crucial point leading to decoupling is that, the signs of the az 
matrices of the coupled spins, controlled by the X gates, disagree for half of the time. 

7.3.2 Sign matrix and decoupling criteria 

In general, we consider schemes which concatenate a certain number of equal-time intervals 
and use X gates to control the signs of Z for each spin. The essential information on the 
signs can be represented by a "sign matrix" defined as follows. The "sign matrix" of a pulse 
scheme for n-spins with m time intervals is the n x m matrix with the {i, a) entry being 
the sign of Z^*-* in the a-th time interval. We denote any sign matrix for n spins by Sn- For 
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example, the sequence in Eq. ( [7.10 ) can be represented by the sign matrix 



So 



+ + 
+ - 



(7.11) 



Each row represents a sequence of m intervals for each spin and each interval given by — is 
preceded and followed by X gates for that spin. Therefore, each sign matrix corresponds to a 



sequence of events for the whole system. Following the discussion in Sec. |7.3.1| , decoupling is 
achieved whenever any two rows in the sign matrix disagree in exactly half of the entries (all 
couplings are negated for exactly half of the time) . The general construction of decoupling 
scheme is now reduced to finding sign matrices satisfying the above criteria. 

As an illustration, we construct a decoupling scheme for four spins. We first find a 
correct sign matrix and then derive the corresponding pulse sequence. For example, a 
possible sign matrix is given by 



+ + 

+ + 

+ - 

+ - 



+ 



+ 



(7.12) 



in which any two rows disagree in exactly two entries. The sequence corresponding to 5*4 
can be obtained by converting each column to a time interval before and after which X 
pulses are applied to spins (rows) given by — 's. No pulses are applied to spins (rows) with 
+'s. The resulting sequence. 



(4)^ 



(7.13) 



is the identity by construction and this can also be verified directly. Note that Tic in 



-iHct 



now denotes the sum of six possible coupling terms for four spins. Note also 



Eq. ( 7.13| ) is written in such a way that it corresponds visually to the sign matrix, though 
the evolutions are actually in reverse time order relative to 6*4. However, such ordering is 
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irrelevant for commuting evolutions. Since can be simplified to 



(7.14) 



This simplified pulse sequence can also be obtained directly from Eq. ( [7.12 ) by converting 
columns to time intervals and inserting X^^^ between intervals whenever the i-th row changes 
sign or whenever a — sign reaches either end of the row. The relation between the sequences 
in Eq. ( |7.13|) and Eq. (|7.14D and 54 is illustrated in Fig. |7T . 



COl 

C02 

m 

004 



(a) 



COl 
032 
033 
034 



(b) 



time 
>- 



t t t t 



Figure 7.1: (a) Pulse sequence corresponding to Eq. ( [i".13| ). From S4, each "— " sign in the 
i-th row and a-th column translates to two X pulses at before and after the a-th time 
interval, (b) Pulse sequence obtained from simplifying (a). This corresponds to Eq. ( 7.14| ), 
and can be constructed directly from 5*4 by translating each change of sign in the i-th row 
to an X pulse at Wj. A "— " sign at the end of the row also gives rise to an X pulse at end 
of the last time interval. 



The above scheme can be generalized to decouple n spins with m time intervals as 
follows: 

Construct the n x m sign matrix Sn, with entries + or — , such that any two 
rows disagree in exactly half of the entries. For each — sign in the i-th row and 
the a-th column, apply X^^^ before and after the a-th time interval. 

Because of the pulses, in each time interval, each Z^*) has a sign as given by the sign matrix. 
The az matrices of any two spins therefore have opposite signs for half of the time, during 
which their coupling is negated, and the evolution is always canceled. 

For n spins, n x m sign matrices which correspond to decoupling schemes do not nec- 
essarily exist for arbitrary m, but they always exist for large and special values of m. A 
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possible structure is: 



+ 



+ + 



+ + 



+ + 



+ 



+ 



+ + 



+ - 



+ 



+ - 



- + 



+ - 



+ 



+ 



- + 



- + 



in which intervals are bifurcated when rows (spins) are added. Such bifurcation takes place 
whenever it is impossible to add an extra row that is orthogonal to all the existing ones 
("depletion"). If such depletion occurs frequently, the sign matrix will have exponential 
number of columns, and decoupling will take exponential number of steps as n increases. 
The challenge is to find correct sign matrices with subexponential number of columns. 

7.3.3 Equivalent decoupling criteria 

The criteria for a sign matrix Sn to represent a valid decoupling scheme is that any two 
rows disagree in exactly half of the entries. It is useful to rephrase the complicated criteria 
concisely. Suppose it is replaced by ±1 in 5„. If 5„ satisfies the decoupling criteria, any two 
rows have zero inner product and therefore SnS^ = nl. Conversely, any n x m matrix M 
with entries ±1 satisfying MM'^ = n/ is a valid sign matrix giving a decoupling scheme that 
requires m time intervals. We now present very efficient solutions to this simple criteria, 
namely the Hadamard matrices. 

7.3.4 Hadamard matrices 

Hadamard matrices have applications in many areas such as the construction of designs, 
error correcting codes and Hadamard transformations p8| , |l], |111| , |8^ . 

A Hadamard matrix of order n, denoted by H{n), is an n x n matrix with entries ±1, 
such that 



H{n)H{nf = nl . 



(7.15) 
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The rows are pairwise orthogonal, therefore any two rows agree in exactly half of the 
entries. Likewise columns are pairwise orthogonal. We abbreviate "±1" as "it". S2 and 5*4 
in Eqs. ( [7.11 ) and ( |7.12| ) are simple examples of H(2) and i?(4). An example of H(12) is 
given by 



iJ(12) = 



+ + 



+ + - 

- + + 

+ - + 

+ + - 

+ - + 

+ - - 

+ + - 



+ 



- + - 



+ + 
- + 



+ 



(7.16) 



The following is a list of useful facts about Hadamard matrices (details and proofs 
omitted): 

1. Equivalence Permutations or negations of rows or columns of Hadamard matrices 
leave the orthogonality condition invariant. Two Hadamard matrices are equivalent if 
one can be transformed to the other by a series of such operations. Each Hadamard 
matrix is equivalent to a normalized one, which has only +'s in the first row and 



column. For instance, H{12) in Eq. (7.16) can be normalized by negating the 7-th 
row and column. 



2. Necessary conditions H{n) exists only for n = 1, n = 2 or n = mod 4. This is 
obvious if the matrix is normalized, and the columns are permuted so that the first 



158 



CHAPTER 7. LOGIC GATES IN NMR - DECOUPLING AND RECOUPLING 



three rows become: 



+ ••• 


+ 


+ • 


• + 


+ • 


• + + • 


• + 


+ ••• 


+ 


+ • 


• + 








+ ••• 


+ 






+ • 


• + - • 





3. Hadamard's conjecture H{n) exists for every n = mod 4. This famous conjec- 
ture is verified for all n < 428. 

4. Sylvester's construction [|109(| If H{n) and H{m) exist, then H{nm) can be con- 
structed as H{n) H{m). In particular, H(2^') can be constructed as 77(2)®'', which 
is proportional to the matrix representation of the Hadamard transformation for r 
qubits. 

5. Paley's construction Let q be an odd prime power, li q = 2> mod 4, then H{q + 1) 
exists; \i q = 1 mod 4, then H{2{q + 1)) exists. 

6. Numerical facts For an arbitrary integer n, let n and n be the largest and 
smallest integers that satisfy n < n <n with known H{n) and H(ji). We define the 
"gap", 5n, to be n — n (see Fig. |7.2| ). For n < 1000, H{n) is known for every possible 
order except for 6 cases, and the maximum gap is 8. For n < 10000, H{n) is unknown 
for 192 possible orders and the maximum gap is 32. 

< 8n — >■ 

— I 1 1 ^ 

n n FT 

Figure 7.2: The gap 5n between two existing orders of Hadamard matrices. 

The importance of the full connection to Hadamard matrices will become clear after we 
construct the scheme for an arbitrary number of spins in the next section. 
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7.3.5 Hadamard matrices and decoupling scheme 



It is immediate from previous discussions that each H{n) is a vahd sign matrix giving a 
decouphng scheme for n spins using only n time intervals. 

For example, S2 and 5*4 in Eqs. ( [7.11 ) and ( 7.12 ) are H(2) and H{A). Whenever H{n) 
exists, there is a decoupling scheme for n spins concatenating only n time intervals. However, 
H{n) may or may not exist for a given n. For an arbitrary integer n, let n be the smallest 
integer that satisfies n <n with known Hin). To construct a decoupling scheme for n spins 
when H{n) does not necessarily exist, we start with H(ji) and take 5„ to be any n x n 
submatrix of Hin). In other words, Sn is formed by choosing n rows from H{n), which 
still achieves decoupling because subsets of rows of Hiji) are still pairwise orthogonal. The 
resulting decoupling scheme for n spins requires n time intervals. 



As an example, 6*9 can be chosen to be the first nine rows of H{12) in Eq. (7.16): 
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(7.17) 



Note that the scheme is efficient if n — n ^ n. A detailed analysis of the efficiency will be 
given after we present the recoupling scheme. 



7.3.6 Recoupling Scheme 

We first construct a scheme which removes both Tiz and Tic. To remove both 7iz and Tic, 
note that the Zeeman term for the i-th spin is linear in Z*^*) , and negating Z^*) for half of 
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the time results in no net Zeeman evolution for the i-ih spin. Therefore, Zeeman evolution 
for all spins can be removed if the sign matrix has identically zero row sum. Such a sign 
matrix can be constructed by starting with a normalized H{n) and excluding the first row 
of H{n) in the sign matrix. Since a normalized H(n) has only +'s in the first row, all other 
rows have zero row sums by orthogonality. Such construction is possible unless n = n, in 
which case construction should start with H{n + 1). For instance, the last nine rows of the 
normalized H{12) is a valid ^g. 

To implement selective rccoupling between the i-th and the j-th spins, the sign matrix 
should have equal i-th and j-th rows but any other two rows should be orthogonal. The 
coupling term QijZ'^^^ ® Z^^^ never changes sign and is implemented selectively, while all 
other couplings are removed. The sign matrix can be obtained from a normalized H{n) by 
first excluding the 1-st row and taking the 2-nd row of H{n) to be the i-th and the j-th rows 
of Sn- The other n — 2 rows of Sn can be chosen from the remaining n — 2 rows of H{n). 
This scheme also removes Hz and requires no more than fi time intervals. To implement 
^ij, the duration of each interval t is chosen to satisfy gijut = tt/4. Note that the total 
time used to implement is the shortest possible, since the coupling is always "on" . 

For example, starting from the normalized H{12), Sg performing can be chosen as 
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7.3.7 Efficiency 



The decoupling and recoupling schemes require n time intervals. They require at most nn 
pulses, since XX = I and the X pulses are only used in pairs. The remaining question is: 
how does n depend on n? If Hadamard matrices exist and can be constructed for all orders, 
n = n. However, some Hadamard matrices are missing, either because no construction 
methods are known or they simply cannot exist. Therefore, n = cn where c > 1. We will 



use the facts about the existence of Hadamard matrices described in Section 7.3.4. As H(n 



exists for every n = mod 4 for n < 428, therefore, n — n < 3 Vn < 428. We argue for 
arbitrary n that the schemes are still very efficient. First of all, we prove that c < 2. For each 
n, there exists r such that 2'^~^ < n < 2^ . Since H{2^) exists by Sylvester's construction, 
cn = n < 2'"< 2n. We now show that c is close to the ideal value 1 in most cases, due to 
the existence of Hadamard matrices of orders other than powers of 2. This is why the full 
connection to Hadamard matrices is useful. First of all, n — n < 31 Vn < 10000. In Fig. |7.3| , 
c as a function of n is plotted for n < 10000. Within this technologically relevant range of 
n, c deviates significantly from 1 only for a few exceptional values of n when n is small. For 
completeness, we present arguments for c ~ 1 for arbitrarily large n in Appendix p.l| . This 
is based on Paley's construction and the prime number theorem. Finally, if Hadamard's 
conjecture is proven, n — n < 3 Vn. 



o 



Figure 7.3: Plots of c vs n, where cn = n is the minimum number of time intervals required 
to perform decoupling or selective recoupling for an n-spin system, c for n < 100 and 
101 < n < 10000 are plotted separately. 
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7.4 Conclusion 

We reduce the problem of decoupling and selective recoupling in heteronuclear spin systems 
to finding sign matrices which is further reduced to finding Hadamard matrices. While the 
most difficult task of constructing Hadamard matrices is not discussed, solutions already 
exist in the literature. Even more important is that the connection to Hadamard matrices 
results in very efficient schemes. 

Some properties of the scheme are as follows. First of all, the scheme is optimal in the 
following sense. The rows of Hadamard matrices and their negations form the codewords of 



the first order Reed-Muller codes, which are perfect codes |111, 33 1. It follows that, for each 
Hadamard matrix, it is impossible to add an extra row which is orthogonal to all the existing 
ones. Therefore, for a given n, n is in fact the minimum number of time intervals necessary 
for decoupling or recoupling, if one restricts to the class of schemes considered. Second, 
the scheme applies for arbitrary duration of the time intervals. This is a consequence of 
the commutivity of all the terms in the Hamiltonian, which in turn comes from the large 
separations of the Zeeman frequencies compared to the coupling constants. Spin systems 
can be chosen to satisfy this condition. Finally, disjoint pairs of spins can be coupled in 
parallel. 

We outline possible simplifications of the scheme for systems with restricted range of 
coupling. For example, a linear spin system with n spins but only /c-nearest neighbor 
coupling can be decoupled by a scheme for k spins only. The i-ih row of the n x k sign 
matrix can be chosen to be the r-th row of H(k), where i = r mod k. Selective recoupling 
can be implemented using a decoupling scheme for k+1 spins. The sign matrix is constructed 



as in decoupling using H{k + 1) but the rows for the spins to be coupled are chosen to be the 
{k + l)-th row different from all existing rows Q. This method involving periodic boundary 
conditions generalizes to other spatial structures. The size of the scheme depends on k and 
the exact spatial structure but not on n. 

Our scheme has several limitations. First of all, it only applies to systems in which spins 
can be individually addressed by short pulses and couplings have the simplified form given 



by Eq. (7.3). These conditions are essential to the simplicity of the scheme. They can all be 
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satisfied if the Zeeman frequencies have large separations. Second, generalizations to include 
couplings of higher order than bilinear remain to be developed. Furthermore, in practice, 
RF pulses are inexact and have finite durations, leading to imperfect transformations and 
residual errors. 

The present discussion is only one example of a more general issue, that the naturally 
occurring Hamiltonian in a system does not directly give rise to convenient quantum logic 



gates or other computations such as simulation of quantum systems [110]. Efficient con- 
version of the given system Hamiltonian to a useful form is necessary and is an important 
challenge for future research. 
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Chapter 8 



Quantum error correction in NMR 



8.1 Introduction 

We have described how quantum error correction can be used to protect information. We 
have also discussed the theories that enable small scale quantum information processing 
in NMR. In this chapter, we describe an experimental implementation of a simple phase 
error detection scheme ||3^ that encodes one qubit in two and detects a single phase error 
in either one of the two qubits. The output state is rejected if an error is detected so that 
the probability to accept an erroneous state is reduced to the smaller probability of having 
multiple errors. Our aim is to study the effectiveness of quantum error correction in a 
real experimental system, focusing on effects arising from imperfections of the logic gates. 
Therefore, the experiment is designed to eliminate potential artificial origins of bias in the 
following ways. First, we compare output states stored with and without coding (the latter 
is unprotected but less affected by gate imperfections). Second, by ensuring that all qubits 
used in the code decohere at nearly the same rate, we eliminate apparent improvements 
brought by having an ancilla with lifetime much longer than the original unencoded qubit. 
Third, our experiment utilizes only naturally occurring error processes. Finally, the main 
error processes and their relative importance to the experiment are thoroughly studied and 
simulated to substantiate any conclusions. In these aspects, our study differs significantly 
from previous work |41] demonstrating quantum error correction working only in principle. 
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We performed two sets of experiments using NMR: (1) the "coding experiments" in 
which input states were encoded, stored and decoded, and (2) the "control experiments" 
in which encoding and decoding were omitted. Comparing the output states obtained from 
the coding and the control experiments, both error correction by coding and extra errors 
caused by the imperfect coding operations were taken into account when evaluating the 
actual advantage of coding. In our experiments, coding reduced the net error probabihties 
to second order as predicted, at the cost of smah additional errors which decreased with 
the original error probabilities. We identified the major imperfection in the logic gates 
to be the inhomogeneity in the radio frequency (RF) field used for single spin rotations. 
Simulation results including both phase damping and RF field inhomogeneity confirmed 
that the additional errors were mostly caused by RF field inhomogeneity. The causes and 
effects of other deviations from theory were also studied. 



The rest of this chapter is structured into five sections: Section 3.2 reviews the phase 



damping model, the two-bit coding scheme, and aspects of bulk NMR quantum computa- 



tion useful for the present discussion. These are reviewed in Sections p.2.1| , p.2.2| and p. 2. 3 . 
Section ^.3| describes the methods to implement the two-bit coding scheme in NMR, and 
the fidelity measures to evaluate the scheme. Section |8.4| presents the experimental details. 
Section |8.5| consists of the experimental results together with a thorough analysis. The 
effects of coding, gate imperfections and the causes and effects of other discrepancies are 



studied in detail. In Section 8.6, we conclude with a summary of our results. We also 
discuss syndrome measurement in bulk NMR, the equivalence between logical labeling and 
coding, the applicability of the two-bit detecting code as a correcting code exploiting clas- 
sical redundancy in the bulk sample and the issue of signal strength in error correction in 
bulk NMR. 

8.2 Theory 

8.2.1 Phase damping 



We first introduced phase damping in Section 2.4. We provide a more physically motivated 



explanation in the following. Phase damping can be caused by random phase shifts e ^^^^^/^ 



8.2. THEORY 



167 



of the system due to its interaction with the environment, causing the state change: 
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b* 




p = 






P = 




b 


c 





a b*e-'^ 
be'^ c 



We model phase randomization as a stochastic Markov process with 9 drawn from a 
normal distribution. The density matrix resulting from averaging over 9 is 



ip') 



1 



27rs 



e'^pd9 



be 



b*e 2 
c 



where is the variance of the distribution of 9. By the Markov assumption, the total 
phase shift during a time period t is a random walk process with variance proportional to 
t. Therefore, we replace by At in Eq. ( |8.2| ) when the time elapsed is t. The effect of 
phase damping is: 



a b* 
b c 



e"^*6 c 



(8.3) 



Since the diagonal and the off-diagonal elements represent the populations of the basis 
states and the quantum coherence between them, the exponential decay of the off-diagonal 
elements caused by phase damping signifies the loss of coherence without net change of 
quanta. 

Phase damping describes the axisymmetric exponential decay of the x and y components 



of any Bloch vector, p = ^{I + xax + ycTy + zctz), as depicted in Fig. 8.1 . 

Recall that phase damping can also be described as a discrete process. Eq. { f>.3\ j de- 
scribing phase damping can be rewritten as 



S{P) = (!-?') Ipl'' + P (^zpcrl, 



(8.4) 



where p= (l-e-^*)/2. In Eq. (|!|), the output £:(p) can be considered {l-p):p mixture 
of p and cr^puj; in other words, £{p) is a mixture of the states after the event "no jump" 
(/) or "a phase error" (fi^) has occurred. The weights 1 — p and p are the probabilities of 
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i |0) i |0) 




Figure 8.1: Trajectories of different points on the Bloch sphere under the effect of phase 
damping. Points move along perpendiculars to the z-axis at rates proportional to the 
distances to the z-axis. As a result, the Bloch sphere turns into an ellipsoid. 



the two possible events. 

We emphasize that Eqs. (|8.3|) and (^) describe the same physical process. Equa- 



tion (8^) provides a discrete interpretation of phase damping, with the continuously chang- 
ing parameter e~^* embedded in the probabilities of the possible events. 

For a system of multiple qubits, we assume independent decoherence on each qubit. For 
example, for two qubits a and b with error probabilities pa and pb, the joint process is given 

by 

£ip) = {1 - pa){l - Pb) (I <E) I) p (I <E) I) 

+ i'^-Pa)Pb [I ® (Tz) p {I ® (Jz) 

+ Pa (1 - Pb) {cFz ®I)P {Tz ® I) 

+ Pa Pb (C^z CTz) p {(Jz (Jz) , (8.5) 

where p denotes the 4x4 density matrix for the two qubits. The events Gz® I and I ® az 
are first order errors, while Gz ® cTz is second order. First and second order events occur 
with probabilities linear and quadratic in the small error probabilities. 

Having described the noise process, we now proceed to describe a coding scheme that 
will correct for it. 
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8.2.2 The two-bit phase damping detecting code 

For a code to detect errors, it suffices to choose the codeword space C such that all errors 
to be detected map C to its orthogonal complement. In this way, detection can be done 
unambiguously by a projection onto C without distinguishing individual codewords; hence 
without disturbing the encoded information. To make this concrete, consider the code 

1 



|0i) = _(|00) + |11)) 
= ^(|01) + |10)) 



(8.6) 
(8.7) 



An arbitrary encoded qubit is given by 



oIOl) +/5|1l 
1 

V2 



a(|00) + |ll))+/3(|01) + |10)) 



(8.8) 
(8.9) 



After the four possible errors in Eq. (|8.5D, the possible outcomes are 



\i^zi) 
\i>zz) 



i^i 



I 



^ |oo) + |ii) I ^ |oi) + |io) 



a 



a 



a 



|00) 



111) 



V2 

|oo)-|ii) 

|00) + |11) 



V2 



+ /3 
+ /3 



|oi)-|io) 
-|oi) + |io) 

-|oi)-|io) 

V2 



^.10) 

^.11) 

^.12) 
^.13) 



with the first order erroneous states \il)zi) and iV'/z) orthogonal to the correct state \'4)ii). 
Therefore, it is possible to distinguish ( |8.10| ) from ( 8.11 ) and ( 8.12 ) by a projective mea- 
surement during decoding, which is described next. 

The encoding and decoding can be performed as follows. We start with an arbitrary 
input state and a ground state ancilla, represented as qubits a and b in the circuit in Fig. |8.2| . 

To encode the input qubit, H is applied to the ancilla, followed by CNOTftQ. Let spins a 
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1 V. 








1 / 
H 7--7^ 



I'Pl) |'P2) |<P3) 



P4 



Figure 8.2: Circuit for encoding and decoding. Qubit a is the input qubit. I^i-s) are 
given by Eqs. ( ^.14D - (^.16|) . p4, are mixtures of the states in Eqs. ( ^.10|) -( 8ri^ ) and in 
Eqs. ( 8.1^ )-( [8.2lD . A phase error in either one of the qubits will be revealed by qubit b 
being in |1) after decoding, and in that case, qubit a will be rejected. 



and b be the first and second register. Then, the qubits transform according to Fig. as 



(a|0) + /?|l))|0) 

-^(a|0)+/3|l))(|0) + |l)) 
1 

1 

71 



(a|0)+/?|l))|0)+(a|l)+/3|0))|l) 
a(|00) + |ll))+/3(|01) + |10))l , 



^.14) 
^.15) 

^.16) 

^.17) 



where Eq. ( ^.17[) is the desired encoded state. 

The decoding operation is the inverse of the encoding operation (see Fig. so as 
to recover the input (a|0) + /3|1))|0) in the absence of errors. Phase errors lead to other 
decoded outputs. The possible decoded states are given by: 



(a|0)+/3|l))|0) 



\tpzz) 



(«|o)-/?|i))|i) 

(a|0)+/3|l))|l) 
(a|0)-/J|l))|0). 



(8.18) 
(8.19) 
(8.20) 
(8.21) 



Note that the ancilla decodes to |1) if and only if a single phase error has occurred. 
Moreover, qubits a and b are in product states but they are classically correlated. Therefore, 
the syndrome can be read out by a projective measurement on b without measuring the 
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encoded state. The decoding operation transforms the codeword space and its orthogonal 
complement to the subspaces spanned by |0) and |1) in qubit 6, while all the encoded 
information, either with or without errors, goes to qubit a. 

We quantify the error correcting effect of coding using the discrete interpretation of the 



the errors I®/, I<^az, (Tz^^I, and (Tz(^(Jz occur with probabilities (1— pa)(l— Pb), 0-—Pa)Pb-, 
Pa{^ — Pb) and paPb respectively, and only in the first and the last cases will the output state 
be accepted. The probability of accepting the output state is {1 — pa){l— Pb) +PaPb whereas 
the probability of accepting the correct state is (1 — pa)(l —Pb)- The conditional probability 
of a correct, accepted state is therefore 



for small pa, Pb- The code improves the conditional error probability to second order, as a 
result of screening out the first order erroneous states. 

We conclude this section with a discussion of some properties of the two-bit code. First, 
the code also applies to mixed input states since the code preserves all constituent pure 
states in the mixed input. Second, we show here that two qubits are the minimum required 
to encode one qubit and to detect any phase errors. Let C be the 2-dimensional codeword 
space and E he a, non-trivial error to be detected. For phase damping, E is unitary and 
therefore EC is also 2-dimensional. Moreover, C and EC must be orthogonal if E is to be 
detected. Therefore the minimum dimension of the system is 4, which requires two qubits. 
However, using only two qubits implies other intrinsic limitations. First, the code can detect 
but cannot distinguish errors. Therefore, it cannot correct errors. This affects the absolute 
fidelity (the overall probability of successful recovery) but not the conditional fidelity (the 
probability of successful recovery if the state is accepted). Second, the error cTz ® cannot 
be detected. This affects both fidelities but only in second order. To understand why these 
limitations are intrinsic, let {i?fc} be the set of non-trivial errors to be detected. Since E}^C 
has to be orthogonal to C for all and since C has a unique orthogonal complement of 
dimension 2 in a two-bit code, it follows that all E^C are equal, and it is impossible to 



noise process, leaving a full discussion of the fidelity to Section Recall from Eq. ( |8.5| ) that 



(l-p,)(l-pb) 



1 - PaPb 




(1 -Pa)(l -Pb) +PaPb 
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distinguish (and correct) the different errors. By the same token, for any distinct errors 
E^i and E'^, Ej^iE^C = C because they are both orthogonal to E/^iC, which has a unique 
2-dimensional orthogonal complement. Therefore, a two-bit code that detects single phase 
errors can never detect double errors. Finally, since a detecting code cannot correct errors, 
it can only improve the conditional fidelity of the accepted states but not the absolute 
fidelity. We remark that the conditional fidelity is a better measure in our experiments 
due to the bulk nature of the system used to implement the two-bit code. A discussion of 
fidelity measures in our experiments and quantum error correction in bulk systems will be 
postponed until Sections 3^ and The system in which the two-bit code is implemented 
will be described next. 



8.2.3 Bulk NMR Quantum Computation 

We use a two qubit NMR system, with reduced Hamiltonian (see also Fig. ^.3| 



ri = —CJz ®i- —I + — O-^ ®az+ Henv , 



(8.23) 



where the symbols have their usual meaning defined in Section 3.2 and gab = vrJ/2. 



ho): 



|b"high ^ 



101): 
loo): 
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(to +C0J/2 +nJ/2 
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'low 
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Figure 8.3: Energy diagram for the two-spin nuclear system. The transitions labeled a^o^, 
a-high, blow and bhigh refer to transitions (|0) ^ |1))|0), (|0) ^ |1))|1), |0)(|0) ^ |1)) and 
|1)(|0) <-> |1)) respectively. 



We define some frequently used gates based on the discussion in Sections |2^ and p. 2 . 
The single qubit rotation e~*2°'''' is denoted by Rr^{6). We denote rotations of 7r/2 along 
the X and y axes for spins a and b by Rxa^ Rya, Rxbi and Ryh with respective matrix 
representations e"*^'^^®^, e"'^'^^®'^, e~*f^®'^^, and e"*^^'^'^^ . The rotations in the reverse 
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directions are denoted by an additional "bar" above the symbols of the original rotations, 
such as Rxa- 

In the respective rotating frames of the spins (tracing the free precession of the uncoupled 
spins), only the J-coupling term, e~*~'^^'^'^^, is relevant in the time evolution. In this 
chapter, an evolution of duration t = 27 is denoted by r. It corresponds to the evolution 
g-i4 0-z®o-^ (^2Zab in the previous chapter). 



Recall from Section 3.2.3 that if the density matrix is X]jj=o ^ij^i ® '^j the onset of 



the measurement, four spectral lines at frequencies 
obtained, with corresponding peak integrals: 



J Ula J <^b 



27r 2 ' 27r 2 ' 27r 2 ' 27r 2 



4, can be 



^high 



h 



high 



h,. 



i{ciO - Cis) + C20 - C23 
«(C10 + Cls) + C20 + C23 
«(C01 - C31) + C02 - C32 

i{coi + C31) + C02 + C32 



(8.24) 
(8.25) 
(8.26) 
(8.27) 



Note that the expression cio — C13 occurring in the high frequency line of spin a is the 
coefficient of ax ® |1)(1| in p(0); cio + C13 in the low frequency line of spin a is the coefficient 
of (Ta, 10)(0|. Likewise, C20 — C23 is the coefficient oi ay®\l){\\ andc2o + C23 is the coefficient 



of fJ,, 



These quantities signify the transitions |0) ^ |1) for spin a conditioned on 



spin h being in |1) or |0). Similar observations hold for the high and low lines of spin h (see 
Fig. P). 

Generalized amplitude damping is much slower than phase damping in our system so 
that unitality is a good approximation. When the evolution of the density matrix is given 
by £, the effective evolution of the deviation is given by (see Section |6.3.1j) 



PA ^ v{£{I) - I) + £{p^) . 



i.28) 



The second term in Eq. ( |8.28| ) represents the result of applying £ to the deviation pA when 
neglecting the identity, and the first term is the correction due to non-unitality. In our 
experiment, £{I) — I is small compared to other effects and can be treated as a small extra 
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distortion of the state when making the unitahty assumption. 



8.3 Two-bit code in NMR 



We now describe how the two-bit code experiment can be implemented in an ensemble of 



two-spin systems. Modifications of the standard theories in Section 8.2.2 are needed. These 



include methods for state preparation, designing encoding and decoding pulse sequences, 
methods to store the qubit with controllable phase damping, and finally methods to read 
out the decoded qubit. Fidelity measures for deviation density matrices are also defined. 

Spins a and b are designated to be the input and the ancilla qubits respectively. The 
output states of spin a are reconstructed from the peak integrals at frequencies a;a/2vrib J/2. 



Fig. 8.4 schematically summarizes the major steps in the experiments, with details given in 
the text. 



control 
experiment 



+ 



Expt4 



Expt3 



CN 



coding _|_ 
experiment 



Expt2 



Exptl 



CN 



Pure state 
preparation 



R 



■ya\ 



R 



■ya\ 



R^ 



■ya\ 



R^ 



■ya\ 



Input state 
preparation 



Encoding 



PD 



PD 



PD 



PD 



Phase 
damping 



Udec 



Ude 



Decoding 



Rx 



Rx 



Rx 



Rx 



Pth 



Pi 



P2 



P3 



PA 



P5 



Figure 8.4: Schematic diagram for the two-bit code experiment. CN is used to prepare 
the initial state. Rya{0) is a variable angle rotation applied to prepare an arbitrary input 
state, which is then subject to phase damping (PD). In the coding experiment, encoding 
and decoding operations, Uenc and Udec-, are performed before and after phase damping, 
whereas in the control experiment, these operations are omitted. Rxa is used as a readout 
pulse on spin a to determine the output state p^ in spin a. pi corresponds to \ipi) or pi in 



Fig. 8.2. Details are described in the text. 



Some notation is defined as follows. Initial states and input states refer to po and pi in 



Fig. p.4| . The phrase "ideal case" refers to the scenario of having perfect logic operations 
throughout the experiments and pure phase damping during storage. 
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Initial state preparation 



It is necessary to initialize spin b to |0) before the experiment. This can be done with 
temporal labeling. The idea is to add up the results of a series of experiments that begin 
with different preparation operations Pk before the intended computation. The result is 
equivalent to performing the computation on the initial state J2k PkPthPl- To prepare spin 
h in an effective pure state, only two experiments are needed: the first experiment starts 
with no additional pulses; the second experiment starts with CN (Fig. |6.2| ) which acts as 
CNOTfea on the thermal state. The equivalent initial state is pth + Pen (symbols are as defined 
in Section |6.2.5|) : 



















-U)a + 

-Wf, 



uJa 0-^(8) |0)(0| +ujbl(2)a^ 



^.29) 



The first term in Eq. ( S.29| ) is the desired initial state. The second term cannot affect the 
observable of interest, the spectral lines at u;a/27r, because of the following. The identity 
in o is invariant under the preparation pulse Rya{9). The input state is thus the identity, 
which has no coherence to start with. Therefore, the output state after phase damping in 
both the control and the coding experiment is still the identity. This is non-trivial in the 
coding experiment. However, inspection of Eqs. ( |8.18| )- (|8.21| ) shows that spin a is changed 
at most by a phase in the coding experiment. While Eqs. ( ^.18| )-( |8.2l| ) apply only to the 
case when h starts in |0), the result can be generalized to any arbitrary diagonal density 
matrix in h (proof omitted). It follows from Eqs. ( |8.24 )-( |8.27| ) that the second term is not 
observable in the output spectral lines of a. 

In contrast, the input state in spin a can be a mixed state as given by the first term in 



Eq. (8.29), since the phase damping code is still applicable. Different input states can be 
prepared by rotations about the y-axis of different angles 9 £ [0, vr] to span a semi-circle in 
the Bloch sphere in the xz-plane. Due to the axisymmetry of phase damping (Eq. (p.2|)), 
these states suffice to represent all the states to test the code. 
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We conclude with an alternative interpretation of the initial state preparation. Let the 
fractional populations of |00), |01), |10), and |11) be poOi Poi^ PWi and pu in the thermal 
state. Then, the initial state after temporal labeling is 



(poi+pu) I®\l){l\+2pw I®\0){0\ + 2(poo -Pio) |00)(00| 



^.30) 



where the identity term is not omitted, unlike Eq. ( 8. 29] ). Temporal labeling serves to 
randomize spin a in the first term in Eq. ( 8.3C| ) when b is |1). We have shown previously 
that the identity input state of a is preserved throughout both the coding and the control 
experiments in the ideal case. Consequently, only the last term in Eq. ( ^.30| ) contributes 
to any detectable signal in all the experiments, and we can consider the last term as the 
initial state. 

Having justified both pictures to identify the first term in Eq. (|8.29| ) and the last term 
in Eq. ( ^.30| ) as the initial state, both pictures will be used throughout the discussion. 



Encoding and decoding 

The original encoding and decoding operations are composed of the Hadamard transforma- 



tion and CNOT^a, as defined in Section 3.2. The actual sequences can be simplified and are 



shown in Fig. 8.5 
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Figure 8.5: Pulse sequences to implement the encoder Uenc and the decoder Udec- Time 
runs from left to right. 



The operator Uenc can be found by multiplying the component operators in Fig. 8.5 
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The encoded states are slightly different from those in section g.2.2 



|0l> = -^(|00)+i|ll)) 
\W = ^(i|01) + |10)), 



^.32) 
^.33) 



but the scheme is nonetheless equivalent to the original one. The decoding operation C/^ec 
is given by 
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^.34) 



The possible decoded outputs are the same as in Section B.2.2 except for an overall sign in 
the single error cases. 



Storage 

The time delay between encoding and decoding corresponds to storage time of the quantum 
state. During this delay time, phase damping, amplitude damping and J-coupling occur 
simultaneously. How to single out the effects of phase damping during storage is explained 
as follows. 

First of all, the time constants of amplitude damping, Ti's, are much longer than those 
of phase damping, r2's. Storage times are chosen to satisfy td < T2 <^ Ti. This ensures 
that the effects of amplitude damping are small. 

The remaining two processes, phase damping and J-coupling, can be considered as 
independent and commuting processes in between any two pulses since all the phase damping 
operators commute with the J-coupling evolution exp(— i ct^C^kt^ 7rJtd/2). We choose Jt^ 
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to be even integers to approximate the identity evolution. As J is known with hmited 
accuracy, we add refocusing vr-pulses [^] to spin b (about the y-axis) in the middle and at 
the end of the phase damping period to ensure trivial evolution under J-coupling. These 
pulses flip the z axis for b during the second half of the storage time so that evolution in 
the first half is always reversed by that in the second half. In this way, controllable amount 
of phase damping is achieved to good approximation. 

Control Experiment 

For each storage time t^, input state, and temporal labeling experiment, a control experi- 
ment is performed with the coding and decoding operations omitted. Since phase damping 
and J-coupling can be considered as independent processes, and J-coupling is arranged to 
act trivially, the resulting states illustrate phase damping of spin a without coding. 



Output and readout 

For an input state prepared with Rya{0), the state after encoding, dephasing and decoding 



(p5 in Fig. |8.4|) is derived in Appendix |C.2| and is given by Eq. (C.7) (from now on, uia is 



omitted): 



„coded 



COS {1-Pa-Pb + "iPaPb) CTz + sin 9 {I - Pa - Pb) Ox ® (J + 0-^)/2 

-h COS Q {pa +Pb- '^PaPb) cTz + siu 9 {-pa + Pb) CTx {I -az)/2. (8.35) 



In the control experiment, the corresponding output state is given by Eq. (p.2|) : 



control 
Pb 



cos9 Uz + {I - 2pa)s\ii9 Ux ® (I + az)/2. 



^.36) 



The initial state used in the derivation of Eq. ( |8.36| ) is the first term in Eq. ( ^.29 ), and the 
encoding and decoding operations are as given by Eqs. (|8^ and (|8^ ). 

In the ideal case, the output state can be read out in a single spectrum. Recall that 
the coefficients of —ay ® [I ^ Gz) and —ax ® (/ =b cr^) are the real and imaginary parts of 
the low and the high frequency lines of spin a. Therefore, the coefficients of —az (/ =b o"^) 
and —ax (8) (/ ± cxz) in p<^'^'^<^ and can be read out as the real and imaginary parts 
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of the low and the high frequency hnes of spin a, if Rxa is appUed before acquisition. This 
pulse transforms the z-component of spin a to the y-component leaving the x-component 
unchanged, as described in Section Note that only states with spin b being |0) (|1)) 
contribute to the low (high) frequency line. Therefore, in the coding experiments, the 
accepted (rejected) states of a can be read out separately in the low (high) frequency line. 
There are no rejected states in the control experiments. 

The rest of the chapter makes use of the following notation. "Output states" or "ac- 
cepted states" refer to the reduced density matrices of b before the readout pulse, and are 
denoted by p™'^^"' = b(0|pg°"'^^|0)b and = b(0|pg°'^*™'|0)fe. Rejected states refer to 

b{l\p'^°'^^'^\l)h from the coding experiments. 

The accepted and rejected states for a given input as calculated from Eq. ( 8.35| ) and 
Eq. ( |8.36| ) are summarized in Table 8.1 . 
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il-Pa-Pb + '^PaPb) COS 6 
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rejected state 


(Pa +Pb- ^PaPb) COS 9 


{-Pa+Pb) sine 


control expt.: 






accepted state 


COS 9 


{l-2pa)sin9 


rejected state 









Table 8.1: Input and output states of spin a in the coding and the control experiments. 



The output states p^°'^^'^ and p^^'^^'^''^ as predicted by Table 0, are plotted in Fig. ^ 
in the xz-plane of the Bloch sphere of spin a. The north and south poles represent the 
Bloch vectors ±5 (|0) and |1)). The time trajectories of various initial states are indicated 
by the arrows. The Bloch sphere is distorted to an ellipsoid after each storage time. We 
concentrate on the cross-section in one half of the xi-plane, and call the curve an "ellipse" 
for convenience. The storage times plotted have equal spacing and correspond to Pa = 
0,0.071,0.133,0.185,0.230,0.269. For each ellipse, is chosen to be the same as pa- The 
main experimental results will comprise of information of this type. 
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Figure 8.6: Predicted output states (a) with or (b) without coding. The arrows indicate the 
direction of time and the ehipses represent snapshots of the original surface of the Bloch 
sphere. 



Fidelity 

One can quantify how well the input states are preserved using various fidelity measures. In 
classical communication, the fidelity can be defined as the probability of successful recovery 
of the input bit string in the worse case. In quantum information processing, when the 
input Pin is pure, the above definition generalizes to the minimum overlap fidelity (see also 
Eq. O)), 

= minp^^TripoutPin) ■ (8.37) 



We emphasize that Eq. (|8.37) applies to pure input states only. The reason why Eq. (8.37) 



is sufficient for our purpose will become clear later. 

When Pin and pout are qubit states of unit trace with respective Bloch vectors fin and 
font, Eq. ( 8.37| ) can be rewritten as 



= miuf^^^il + fin ■ rout) ■ (8.38) 
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From Eq. ( pl^ ) for phase damping, when fin = {rx,ry, Vz), font = (e ^^^x, e ^^ry,rz). There- 
fore, 

hn-rout = e-^\rl+rl)+rl (8.39) 

= -2p{rl + rl) + l, (8.40) 

where we have used the fact |rinP = 1 for pure states and p = {1 — e~'^*)/2. The minimum 
in Eq. ( ^.38| ) is attained for input states on the equatorial plane with r'^+Vy = 1. Therefore 

J^ = l-p= ^(1 + e-^*). (8.41) 

With coding, the accepted state is (see Eqs. ( ^.18D -( ^l2l| )) 

pcoded ^^^^ _ ^^^p,^ ^ PaPbazpin(yz ■ (8.42) 



If one considers the conditional fidelity in the accepted state, pout in Eq. ( 8.371 ) should be 
taken as the post measurement density matrix. 



^coded ^coded 
Ha Ha 



Pout T^.f^lcoded^ (1 _ + p^p^ (8-43) 

{l-PaPb)Pin+PaPbCrzPinCrz- (8.44) 



Note that the above expression is identical to the expression for single qubit phase damp- 
ing but with error probability p = PaPb- Therefore, coding changes the conditional error 
probability to second order, and the conditional fidelity is improved to J^c = 1 ~ PaPb- 

The amount of distortion can also be summarized by the ellipticities of the "ellipses" 
that result from phase damping. The ellipticity e is defined to be the ratio of the major 
axis to the minor axis. Without coding, the major axis remains unchanged under phase 
damping, and the minor axis shrinks by a factor of e~^*, therefore e = e^*. Using Eq. ( |8.41 ), 

^=i(l + i). (8.45) 

With coding, J^c is given by the same expression on the right hand side of Eq. ( ^.45|) . In the 
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ideal case, the overlap fidelity and the ellipticity have a one-to-one correspondence. In the 
presence of imperfections, the overlap fidelity and the ellipticity, one being the minimum of 
the input-output overlap and the other being an average parameter of distortion, are more 
effective in reflecting different types of distortion. 

We now generalize to new definitions of fidelity for deviation density matrices for the 
two-bit code. In NMR, quantum information is encoded in the small deviation of the state 
from a completely random mixture. The problem with the usual definitions of fidelity is 
that they do not change significantly even when the small deviation changes completely. 
This is true whether the fidelities are defined for pure or mixed input states. To overcome 
this problem, we introduce the strategy of identifying the initial excess population in |00) as 
the pure initial state so that usual definitions of fidelity for pure input states are applicable. 
This improves the sensitivity of the fidelity measures and provides a closer connection to 
the pure state picture. 



The initial state in Eq. (8.30) can be rewritten as 

p = aff""'^ + (1 - a)/?"''^"* , (8.46) 
where a = 2(poo — Pw) = ^'^a/2A;fiT, and 

fP^^^ = |00)(00| (8.47) 



quiet 



1 



1 - a 



(m+Pii) +2pio /^|0)(0| . (8.48) 



It has already been shown that p?"**^* is irrelevant to the evolution and the measurement of 
pP"^<^ when all processes are unital. Therefore p?™*^* is neglected and the small signal result- 
ing from the slow non-unital processes will be treated as extra distortion to the observable 
component. The input state prepared by Rya{0) can be written as 



For the state change pin £{pin), we consider the overlap between /of"'^^ and <f(/?f, 
in place of the overlap between pin and £{pin). This defines a new overlap fidelity J^j\ 



m ) 
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min^piire Tr(pf^^'^<5(/9f^''^)) = rainf.^ ^(1 + • rout) similar to the pure state case. 

J^A can be calculated from the experimental results in the following manner. The 
measured Bloch vector of a, fm, is proportional to that defined by b(0|iS(pf^'"'^)|0)f,. Due 
to limitations in the measurement process, this proportionality constant d is not known 
a priori. However, when = in the control experiment, £{f/^^^^) = p\Y'^ and fm = ar^. 
Therefore, a = |fV;,|£i=o can be determined. In other words, |fm|0=o is used to normalize all 
other measured output states before using the expression for .Fa. 

The expression for can also be used for the conditional fidelity in the coding exper- 
iment if the post-measurement accepted output state is known. This requires Tr(p™'^'^'^) = 
— Pa — Pb + '^PaPb) to be determined for each storage time. The correct normalization is 
again given by the output at = 0, which equals fvi = Tr(pJ^°'^'^°')Q!fj„. 

In summary, each ellipse obtained in the coding and the control experiment is normalized 
by the amplitude at 6* = 0: 



J^A = min,;- — 



1 + 



\rm{0 = 0)1 



^.50) 



It is interesting to note that in contrast to the fidelity measure, the ellipticity measure 
naturally performs an equivalent normalization, and thus can be used for deviations without 
modifications. 

We now turn to the experimental results, beginning with a description of our apparatus. 



8.4 Apparatus and experimental parameters 



We performed our experiments on carbon-13 labeled sodium formate (CHOO~Na+) (Fig. 8.7) 
at 15°C. The nuclear spins of proton and carbon were used as input and ancilla respectively. 
Note that the system was heteronuclear. The sodium formate sample was a 0.6 milliliter 
1.26 molar solution (8:1 molar ratio with anhydrous calcium chloride) in deuterated water 
P]. The sample was degassed and flame sealed in a thin walled, 5mm NMR sample tube. 



The time constants of phase damping and amplitude damping are shown in Table 8.2 



The fact T2 <C Ti ensures that the effect of amplitude damping is small compared to 
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.13 
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o 



Figure 8.7: ^^C-labeled formate. The nuclear spins of the neighboring proton and carbon 
represent qubits a and b. 

phase damping. The experimental conditions are chosen so that proton and carbon have 
almost equal r2's. This ehminates potential bias caused by having a long-lived ancilla when 
evaluating the effectiveness of coding. This also realizes a common assumption in coding 
theory that identical quantum systems are available for coding. Subsidiary experiments 



with qubits having very different T2's are described in Appendix C.3 





Ti 


T2 


13c 


9 s 
13.5 s 


0.65 s 
0.75 s 



Table 8.2: Ti's and T2's for CaCl2-doped formate at 15°C, measured using standard 
inversion recovery and Carr-Purcell-Meiboom-Gill pulse sequences respectively. 



Phase damping arises from constant or low-frequency non-uniformities of the "static" 
magnetic field which randomize the phase evolution of the spins in the ensemble. Several 
processes contribute to this inhomogeneity on microscopic or macroscopic scales. Which 
process dominates phase damping varies from system to system ||l^. For instance, in- 
termolecular magnetic dipole-dipole interaction dominates phase damping in a solution of 
small molecules, whereas the modulation of direct electron-nuclear dipole-dipole interac- 
tions becomes more important if paramagnetic impurities are present in the solution. For 
molecules with quadrupolar nuclei (spin > 1/2), modulation of the quadrupolar coupling 
dominates phase damping. Other mechanisms such as chemical shift anisotropy can also 
dominate phase damping in other circumstances. These microscopic field inhomogeneities 
have no net effects on the static field when averaged over time, but they result in irre- 
versible phase randomization with parameters intrinsic to the sample. Another origin of 
inhomogeneity comes from the macroscopic applied static magnetic field. In contrast to 
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the intrinsic processes, phase randomization due to this inhomogeneity can be reversed by 
applying refocusing pulses as long as diffusion of molecules is insignificant. 

Phase damping caused by the intrinsic irreversible processes alone has a time constant 
denoted by T2, while the combined process has a shorter time constant denoted by T|. T2 



is measured by the Carr-Purcell-Meiboom-Gill |50| experiment using multiple refocusing 
pulses. r| can be estimated from the line-width of the NMR spectral lines: during acquisi- 
tion, the signal decays exponentially due to phase damping, resulting in Lorentzian spectral 
lines with line- width l/vrT^ . 

In our experiment, T|'s for proton and carbon were estimated to within 0.05 s to be 
~ 0.35 and 0.50 s. The storage times were approximately 0, 62, 123, 185, 246, 308 ms 
{n/J for n = 0,12,24,36,48,60). The maximum storage time was 120 r, long compared 
to the clock cycle and was comparable to T|. The decay constant A, defined in Section 
S.2.1| , was given by A = l/T^. The error probabilities after a storage time of td were 
Pi = (1 — exp(— trf/T|j))/2 for spins i = a,b. To reconstruct the ellipse for each storage time, 
11 experiments were run with input states spanning a semi-circle in the xz-plane. Each 
input state was prepared by a Rya{0) pulse with = nvr/lO for n = 0, 1, • • • , 10. 

All experiments were performed on an Oxford Instruments superconducting magnet of 
11.7 Tesla, giving precession frequencies of aJa/2'/r ~ 500 MHz for proton and L<Jb/27r ~ 
125 MHz for carbon. A Varian ^^^'^ Inova spectrometer with a triple-resonance probe was 
used to send the pulsed RF fields to the sample and to measure the FID's. The RF pulses 
selectively rotated a particular spin by oscillating on resonance with it. The 7r/2 pulse 
durations were calibrated, and they were typically 8 to 14 /us. To perform logical operations 
in the respective rotating frames of the spins, reference oscillators were used to keep track 
of the free precession of both spins, leaving only the J-coupling term of 195.0 Hz in the 
time evolution. Each FID was recorded for ~ 6.8 s (until the signal had faded completely). 
The thermal state was obtained after a relaxation time of 80 s (S> Ti's) before each pulse 
sequence. 

Using the above apparatus and procedures, we performed the experiments as outlined 
in Section p.3|. The experimental results are described in the next section. 
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8.5 Results and discussion 



8.5.1 Decoded Bloch spheres 

The output states, p™*^^*^ and /9jj°"*''°', obtained as described in Sections ^]3| and |8.4| , and 
the analysis that confirms the correction effects of coding, are presented in this section. 

Figure shows the accepted states in the xi-plane of the Bloch sphere of spin a. p'^°'^^'^ 
and plotted in Figs. ^.8| a and b. The ehipse for each storage time is obtained 

by a least-squares fit described later. 
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Figure 8.8: (a) Experimental data (circles) showing the output states from the coding 
experiment. Each ellipse (solid line) corresponds to one storage time and is obtained by a 
least-squares fit (Eq. ( 8.56| )) to the data. The storage times are n x 61.5ms for n = 0, 1, • • • , 5, 
and shrinking ellipses correspond to increasing n. (b) Experimental data (crosses) and fitted 
ellipses (solid lines) for the control experiment. A replica of figure (a) is plotted in dotted 
lines for comparison. In both figures, uncertainties in the data are much smaller than the 
circles and crosses. 



The most important feature in Fig. is the reduction of the ellipticities of the ellipses 
due to coding, which represents partial removal of the distortion caused by phase damping 
- the signature of error correction. Coding is effective throughout the range of storage times 
tested. 

We quantify the correction effects due to coding using the ellipticities. When deviations 
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from the ideal case such as offsets of the angular positions of the points along the ellipses 
and attenuation of signal strength with increasing 9 exist, the minimum overlap fidelities 
and the ellipticities are no longer related by Eq. ( 8.45| ). Since the ellipticity is an average 



measure of distortion which is less susceptible to scattering of individual data points, we 
first study the ellipticities. Moreover, since the deviations from the ideal case are small, 



we can still infer the fidelities from the ellipticities using Eq. ( g.45 ). A discussion of the 
discrepancies and the exact overlap fidelities will given later. 

Ellipticities In the ideal case, the ellipticity for each ellipse can be obtained experimentally 

as 

where I denotes the intensity (amplitude square) of the peak integral. I is given by the x 
and z-components of the output states as 



I = rl + rl (8.52) 



In the ideal case, I {9) can be found from Table 8.1 



IcontroliO) = l-A{pa-pl)sm^e (8.53) 

'^coded{0) = {I - Pa - Pb + '^PaPbf - '^PaPbi'^ - Pa - Pb + PaPb) Sin^ d , (8.54) 

and both are of the functional form 

I^deal{0) = A + Bsm^9. (8.55) 



Experimentally, the output Bloch vectors do not form perfect ellipses. We modify Eq. ( 8.55 ) 
to include signal strength attenuation with increasing 9 and constant offsets in the angular 
positions: 

lexpie) = iA + Bsm\9 + D)){l-Ci9 + D)), (8.56) 
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and perform non-linear least-squares fits of the experimental data to determine A, B, C, and 



D. The fitted ehipses plotted in Fig. 3. J follow from Eq. (|8.56|) and the fitted parameters. 



The ellipticities e are found using Eq. ( 8.51 ) by interpolating the intensities at 9 = 



and 6 = ^. The ellipticities are plotted in Fig. |8.9| a. The uncertainties of the fitted 
parameters originate from the uncertainties in the data, which are estimated to be ~ 1% 
for the amplitude and 1.5 degrees for the phase in the measured peak integrals. These 



uncertainties are propagated numerically to the ellipticities as plotted in Fig. 8.9 a. Ideal 
case predictions and simulation results are also plotted in Fig. |8.9| a. The simulation takes 
into account the major imperfection in the pulses and will be described later. 
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Figure 8.9: (a) Ellipticity and (b) inferred fidelity as a function of the storage time in the 
coding and the control experiments. Error bars represent 95% confidence level. 
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Error correction The effectiveness of coding to correct errors is evident when comparing 
the ehipticities from the coding and the control experiments (Fig. |8.9| a). Without coding, 
the ehipticity grows exponentiahy as e^'^-^'^^a (for fitted to be ~ 0.4s). With coding, 
the growth is slowed down, with almost zero growth for small t^- The suppression of linear 
growth of the ellipticity can be further quantified by weighted quadratic fits e = co + cif^ + 
C2t^ to the ehipticities. For the control experiments, cq = 1.00 ± 0.01, ci = 1.31 ± 0.21 and 
C3 = 8.8 lb 0.8 whereas for the coding experiments, cq = 1.10 it 0.02, ci = —0.24 it 0.29 and 
C3 = 3.8 lb 0.9 {td in seconds). Therefore, the linear term "vanishes" due to coding. The 
small negative coefficient for the linear term originates from the scattering of the data point 
at zero storage time. 

To quantify the "cost of the noisy gates" caused by the imperfect pulses, we compare 
the ehipticities from the coding experiments and from the ideal case, the quadratic fits of 
which are respectively e^°^f = 1.10 - 0.24*^ + 3.80t^ and e'r^f^f = 1.00 + O.lbtd + 2.50t^. 
The imperfections cause the ellipticity to increase by 0.1 at = and this extra distortion 
decreases with 1^. We take advantage of the fact that the simulation results are close to 
the data points but are not as scattered to have a better estimate of this "cost of the noisy 
gates". The simulation data can be fitted by e'^J^^'^ = 1.06 + 0.32*^ + 2.47t^. Compared to 
the ideal case, the coding operations increase the ellipticity by « 0.06 at td = 0, and this 
extra distortion remains almost constant for all td- 

The error probabilities as inferred from the ehipticities Pe = 1 — J-e = ^(l"^) ^-re 
plotted in Fig. b as a function of storage time. 

Error correction is also manifest by expressing in the coding experiment as a function 



of the original Pe in the control experiment, as plotted in Fig. 3.1C. The quadratic fit to 
the experimental results gives Pe°p^'^ = cq + cip + C2p'^ where p stands for in the control 
experiment, cq = 0.047 ± 0.008, ci = -0.05 ± 0.12 and C2 = 1.38 ± 0.40. Therefore, the 
expected improvement p — > PaPb is confirmed. Experimentally, the error probabilities are 
larger than in the ideal case by at most 4.7% and these extra errors decrease with p. The 
quadratic fit to the simulation results (which is a good approximation of the experimental 
data) gives Pg"^'^ = 0.032 — 0.032p + 1.783p^ and differs from the ideal case by a constant 
amount of ~ 0.033 it 0.003 for all p, which represents the cost of the noisy gates. In 
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conclusion, coding with noisy gates is still effective in our experiments, even though the 
noisy gates add a constant amount of distortion. 

0.18 1 , , , , ^ , , , 1 

0.16- I 1 




0.05 0.1 0.15 0.2 0.25 0.3 

Error probability p, witliout coding 

Figure 8.10: Error probabilities in the coding experiments vs the corresponding values in 
the control experiments. Error bars represent 95% confidence level. The 45° line is plotted 
as a dotted line. 

8.5.2 Discrepancies 

While the data exhibit a clear correction effect, there are notable deviations from the ideal 
case. First, the ellipses with coding are smaller than their counterparts without coding. 
This is most obvious when the storage time is zero, in which case the coding and the 
control experiments should produce equal outputs. Second, the signal strength is attenuated 
with increasing 9 relative to ideal ellipses. Third, although the data points are well fitted 
by ellipses, their angular positions are not exactly as expected ( "^-offsets" ) . Finally, the 
spacings between the ellipses deviate from expectation. The causes of these discrepancies 
and their implications on error correction are discussed next. 

Gate imperfections: RF field inhomogeneity The major cause of experimental errors 
is RF field inhomogeneity, which causes gate imperfections. This was determined by a series 
of experiments (details of which are not given here), and a thorough numerical simulation. 



8.5. RESULTS AND DISCUSSION 



191 



as described below. The physical origin of the problem is as follows. The coil windings 
produce inhomogeneous RF fields that randomize the angles of rotation among molecules. 
For a single rotation, the signal averaged over the ensemble decreases exponentially with 
the pulse duration to good approximation. A measure of the RF field inhomogeneity is the 
signal strength after a tt/2 pulse. They are measured to be ~ 0.96 and 0.92 for proton and 
carbon respectively. In other words, a single tt/2 pulse has an error of ~ 4-8%. 

RF field inhomogeneity affects our experiments in many ways. First, it attenuates the 
signal in both the coding and the control experiments, but the effects are much more severe 
in the coding experiments which have eight extra pulses. For instance, when the storage 
time is zero, the two experiments should have identical outputs, but the ellipse in the coding 
experiment is actually 5-15 % smaller. Second, for each ellipse, attenuation increases with 
9 as the preparation pulse Rya{9) becomes longer. 

The effects of the RF field inhomogeneity are complicated, because the errors from 
different RF pulses are correlated, and the correlation depends on the temporal separation 
between the pulses and the diffusion rate of the molecules. The correlation time of the 
RF field inhomogeneity is comparable to the experimental time scales. For this reason, 
predictions of the effects of RF field inhomogeneity are analytically intractable. 

Numerical simulations were performed to model the dominant effects of RF field inho- 
mogeneity. We followed the evolution of the states assuming random RF field strengths 
drawn from Lorentzian distributions (also known as Cauchy distributions) with means and 
standard deviations matching pulse calibration and attenuation for the 7r/2 pulses. All 
parameters in the simulation, including T|'s, were determined experimentally without in- 
troducing any free parameters. As the exact time correlation function for the errors was 
unknown, except for numerical evidence of a long correlation time, we assumed perfect 
correlation in the errors. The simulated ensemble output signal was obtained by Gaus- 
sian integration with numerical errors bounded to below 1.5%. The results were shown in 



Fig. 8.11 



Besides phase damping and error correction effects in the data, the simulations also 
reproduce extra signal attenuation in the coding experiments. The ellipticities obtained 
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Figure 8.11: Simulated output states, plotted similarly as in Fig. 3^. The simulation results 
are fitted by ellipses similar to those of the experimental data. 



from simulations (see Fig. 3^) approximate the experimental values very well. Such agree- 
ment to experimental results is surprising in the absence of free parameters in the model. 
Simulation results allow the discrepancy between the observed and the ideal ellipticities to 
be explained in terms of the RF field inhomogeneity and allow the "cost of coding" to be 
better estimated to be the constant 6 % increase in ellipticity or the ~ 3 % increase in error 
probabilities. 

The simulation results also predict increasing attenuation with 9. From the fitted pa- 
rameter C (see Eq. ( |8.56| )), the amplitudes at 9 = tt are ~ 4% weaker than the corresponding 
values at = in the simulations. Experimentally, this attenuation increases from ~ 8 to 
15% (as storage time increases from to 308 ms) in the control experiments, and remains 
~ 8% in the coding experiments. Therefore, RF field inhomogeneity contributes to the 
attenuation but only partially. 

We conclude that RF field inhomogeneity as we have modeled explains the diminished 
signal strength in the coding experiments. The simulation quantifies the "cost of the noisy 
gates". RF field inhomogeneity also explains part of the attenuation with increasing 9. We 
can also conclude that other discrepancies not predicted by the simulations are not caused 
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by RF field inhomogeneity and these discrepancies are described next. 

Other discrepancies The simulation results show that RF field inhomogeneity does not 
explain why the attenuation at large 9 increases with storage time without coding, and it 
does not explain the ^-offsets along the ellipses and the unexpected spacings between them. 

The increased attenuation with storage time at large 6 can be caused by amplitude 
damping. A precise description of amplitude damping during storage is out of scope and 
we consider only the dominant effect predicted by a simple picture. Phenomenologically, 
the loss of energy to the lattice is described in the NMR literature by 



where z{oo) = 1 is the thermal equilibrium magnetization. As z{0) = ±1 at 9 = and 
vr, we expect no changes at 9 = 0, but expect \z\ at = vr to decrease by — 7 % for 
^ — 308 ms and Ti ~ 9 s for proton. Note that refocusing does not affect spin a in the 
control experiments Q but it swaps \0l) and halfway during storage, symmetrizing 
the amplitude damping effects in the coding experiments. Therefore, we expect increased 
attenuation with storage time in the control experiments only. This matches our observa- 
tions that the attenuation of I{9 = vr) with respect to I{9 = 0) increases from 8 to 16 % 
in the control experiments, and remains 8% in the coding experiments. Moreover, earlier 
data taken without refocusing (not presented) have the same trend of increased attenuation 
with storage time in both the coding and the control experiments. These are all in accord 
with the hypothesis that amplitude damping is causing the observed effect. 

The second unexplained discrepancy is that the output states span more than a semi- 
ellipse in the coding experiment but slightly less than a semi-ellipse in the control experi- 
ment. We are not aware of any quantum process that can be a cause of it. It is notable that 
the output states and the fitted ellipticities can be used to infer the initial values of 9, and 
they are roughly proportional to the expected values for each ellipse. The proportionality 
constants are 5 — 8 % higher than unity in the coding experiment, and — 1.6 % lower in 
the control experiments. Moreover, similar effects are observed in many other experimental 
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runs. Therefore, this is hkely to be a systematic error. 

We have no convincing explanation for the anomalous spacings between the ehipses in the 
experiment. However, from the fact that all the data points belonging to the same ellipse are 
well fitted by it, the anomalous spacings are unlikely to be caused by random fluctuations on 
the time scales of each ellipse-experiment. The effect of the anomalous spacings is reflected 
in the scattering of the ellipticities of the data, and the large uncertainties in the quadratic 
fits. 

While it is impossible to eliminate or to fully explain these imperfections, it is possible 
to show that the deviations cannot affect the conclusion that error correction is effective. 

Effects of the discrepancies We now consider the effects of the discrepancies on the 
ellipticities and the inferred fidelities in the experiments. First of all, radial attenuation of 
the signal due to RF field inhomogeneity does not affect the ellipticities nor the inferred 
fidelities (taken as conditional fidelities). Second, different expressions for the "ellipticity" 
are not equivalent when the output states do not form perfect ellipses. However, they differ 
by no more than 7 and 3 % in the control and the coding experiments. 0-offsets along the 
ellipses are not reflected in the ellipticities, resulting in overestimated inferred fidelities. This 
is bounded by 3%. The scattering of the ellipticities due to anomalous spacings between the 
ellipses is averaged out with curve-fits to the data. The most crucial point is, none of these 
effects have a dependence on the storage time that can be mistaken as error correction. 
Therefore, the effects of error correction can still be confirmed in the presence of all these 
small discrepancies. 

8.5.3 Overlap fidelity 

The two previous subsections dealing with the ellipticities provide an analysis of the global 
performance of the code. A stricter analysis is provided in this section using the overlap 
fidelities given by Eq. ( |8.5[1| ) in Section The minimal overlap reflects all defects and 
deviations hidden in the ellipticities as well as other distortions such as that caused by 
amplitude damping. 
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All measurements are normalized using the amplitudes at = as described in Sec- 



tion 8.3. In the control experiments, the output states at 6 = are least affected by 
amplitude damping and RF field inhomogeneity. Therefore, the normalization can be done 
accurately. In the coding experiment, the signal attenuation at 6 = due to RF field 
inhomogeneity can lead to overestimated fidelities. We determine the uncertainties due to 
RF field inhomogeneity by the following method. For each storage time, the amplitudes of 
the accepted and the rejected states at ^ = are summed. The sum is compared with the 
corresponding amplitude at ^ = in the control experiment to estimate the attenuation 
due to RF field inhomogeneity. The effects on the overlap fidelities are bounded to below 
2%. The errors in the measured peak integrals are propagated to the fidelities which result 
in standard deviations no more than 0.7%. We apply similar procedures to the simulation 
results. The net error probabilities, given by 1 — ^ for the control and 1 — J-c for the coding 
experiments, are plotted in Fig. p. 12 . 

The large difference in the rates of growth of error probabilities confirms the effectiveness 
of coding even when a stricter measure of fidelity is used. 



;< X w/o coding 
o o /w coding 
simulation 



100 150 200 250 

Storage time [ms] 



Figure 8.12: Overlap fidelity as defined in Eq. (8.50). Points indicate experimental data 
and dashed lines indicate simulation results. Error bars represent 95% confidence level. 
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8.6 Conclusion 

We have demonstrated experimentally, in a bulk NMR system, that using a two bit phase 
damping detecting code, the distortion of the accepted output states can be largely removed. 
These experimental results also provided quantitative measures of the major imperfections 
in the system. The principle source of errors, RF field inhomogeneity, was studied and a 
numerical simulation was developed to model our data. Despite the imperfections, a net 
amount of error correction was observed, when comparing cases with and without coding, 
and including gate errors in both cases. 

Our analysis also addresses several theoretical questions in quantum error correction in 
bulk samples such as the fidelity measures of deviation density matrices. In the following, 
we conclude with some observations regarding quantum error correction in bulk systems, 
including syndrome measurements, the equivalence between error correction and logical 
labeling |5^, ^] (see also Section 6.5.2| ) , the applicability and advantages of detecting codes 



and some issues in signal strength. 

Projective syndrome measurements traditionally employed in the standard theory of 
quantum error correction are impossible in ensemble quantum computation. Measurements 
via the acquisition of the FID do not reduce individual quantum states and provide only 
"average syndromes". Moreover, the quantum states are destroyed after acquisition. How- 
ever, the important point is that syndrome measurement is not necessary in error correc- 



tion |92, 11 



In each molecule, the syndrome bits carry the error syndrome for that particular molecule 
after decoding. These bits can either be used in a controlled-operation to correct the er- 
ror 1^^, or in the case of a detecting code, to "logically label" the correct and erroneous 
states. Conversely, logical labeling to obtain effective pure states can be considered as error 
detection: unsuitable initial states are "detected" and are labeled as "bad" to start with. 
Both processes involve ejecting the entropy of the system to the ancilla bits. Error detection 
and logical labeling are therefore closely related concepts. 

The distinction between error correcting and detecting codes is blurred when using 
bulk systems. The objective of error correction is to achieve reliable data transmission or 
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information processing with high probabihty of success. When information is encoded in 
single systems, encoding with an error detection scheme will fail to provide reliable output 
for two different reasons: accepting an erroneous state or losing the state upon the detection 
of an error. Therefore, coding schemes capable of distinguishing and correcting errors are 
necessary to improve the probability of successful data processing. In contrast, in a bulk 
sample, a large initial redundancy exists upon preparation, and the combined signal of all 
the accepted cases forms the output. Therefore, rejection of the erroneous cases results in a 
reduction of the signal strength in the improved accepted cases without necessarily causing 
a failure. Error detecting codes thus provide a tradeoff between probability of error-free 
computation and signal strength. 

As suggested by this analysis, and in concert with our experimental results, it thus 
makes sense to use detecting codes instead of correcting codes in bulk quantum computing 
systems under certain circumstances. Fundamentally, it is valuable to be able to interchange 
resources depending on their relative costs. This is illustrated by the following simple 
example. Suppose a total pool of m qubits is available for transmission, and one just wants 
to correct for single phase flip errors of probability p. Using a three-qubit code, one would 
obtain an aggregate signal strength of m/3, with fidelity 1 — 3p^, whereas with a two-qubit 
code, the accepted signal strength would be m(l — 2p)/2, with fidelity 1 —p'^. Therefore, 
when p < 1/6, the two-qubit code performs better in this model due to its higher rate. 

Another example relevant to bulk computation arises when the encoding and decoding 
circuits fail with probability proportional to the number of elementary gates used. Although 
errors in consecutive gates can be made to cancel sometimes, this basic scenario is substan- 
tiated by our experiment, in which imperfect pulses contribute significantly to the net error. 
Assume now that we have n molecules, which are either two or three-qubit systems. Let 
us compare the performance of the two and three-qubit codes, based on the strength of the 
correct output signal. Because the correcting code requires at least three times as many 
operations as the detecting code [^], the figures of merit obtained for the two schemes are 
n(l — 3pg) and n(l — 2p){l — pg) respectively, where Pg is the gate failure probability. In 
this model, the detecting code performs better for p < Pg/{1 — Pg) due to the simplicity of 
the coding operations. 
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A third example is the case of current state NMR quantum computation at room tem- 
perature, in which the intrinsic signal strength decreases exponentially with the number 



of qubits 1 53, |2^, In this model, the initial signal strength of an effective pure state 
of m qubits is approximately of the order 2"*", and thus, for an ensemble of n molecules, 
the signal strengths of the outputs from the correcting and detecting codes are about n/8 
and n(l — 2p)/4 respectively. According to this measure of performance, the detecting code 
outperforms the correcting code for p < 1/4 {p < 0.27 in our experiments). 

If signal strength indeed decreases exponentially with m, then some interesting gener- 
alizations can be made. For arbitrary qubit errors, a t-error detecting code has distance 
d > t + 1, while a t-error correcting code has distance d > 2t + 1 fSH]. If one encodes k 



bits in /, the extra number of qubits used, / — k, satisfies the singleton bound |72, 26, 24|, 
I — k > 2d — 2. Therefore, the output signal strengths for the detecting and correcting 
codes would be approximately proportional to (1 — p/(p))/2^* and 1/2^*, where f{p) is a 
polynomial. The detecting code is thus always better asymptotically in this model 0. 

This chapter illustrates how a careful study of dynamics in bulk quantum systems can 
provide a valuable opportunity to demonstrate and test theories of quantum information 
and computation. The development of temporal, spatial, logical, and related labeling tech- 
niques opens a window allowing information about the dynamics of single quantum systems 
to be extracted from bulk systems. Furthermore, by systematically developing an exper- 
imental toolbox of quantum circuits and quantum error correcting and detecting codes, 
experiments which test multiple particle quantum behavior become increasingly accessible. 
With improvements in the initial polarization in the systems and new labeling algorithms 
which do not incur exponential signal loss |100|, and with better methods to control the 
major source of error, the RF field inhomogeneity, we believe that further study of bulk 
quantum systems will complement the study of single quantum systems, provide new in- 
sights into the dynamical behavior of open quantum systems, and further the potential for 
quantum information processing. 



Appendix A 



Master Equation 



A.l Master Equation formalism 



The dynamics of an open quantum system are traditionally described by a "master equa- 
tion" which governs evolution of the density matrix as a function of time. A master equation 
is generally derived in the following manner. The system s, described by p{t), couples to 
an environment e, described by pe, through an interaction Hamiltonian Hj. Due to the 
evolution, quantum information originally in the system is delocalized over both the system 
and the environment. Tracing over the inaccessible environmental degrees of freedom gives 
the reduced density matrix for the system alone. Assuming weak interactions and a mem- 
oryless environment (so that the Born and Markov approximations hold) the Schrodinger 



equation for the system state has the form |82 



1 



dt'Tip 



Hi{t),[Hi{t-t'),p{t)(S)pe] 



(A.l) 



where the operators are given in the interaction picture, and Tc is the correlation time of the 
environment. When Tc is much smaller than the time scales in which p{t) or Hj(t) changes 
significantly, Eq. ( |A.1| ) can be approximated to give 



pit) = 



Hi,[Hi,p{t)^ Pe 



(A.2) 
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where is a prefactor resulting from the integration. We will call Eq. ( [A.2| ) the simplified 
master equation. 

A. 1.1 Common noise processes 

We study two common noise processes, amplitude damping and phase damping, using this 
master equation approach. 

Amplitude damping 



Amplitude damping 52] describes the energy loss from the system to a zero temperature 
environment, and is a good approximation to many physical systems. It can be studied by 
modeling the system as a simple harmonic oscillator (for simplicity, we dispense with self 
Hamiltonians) . The energy exchange between the system and the environment is given by 
the interaction Hamiltonian in the Schrodinger picture: 

Hi = x{a^b + b^a) (A.3) 

where a, b are the annihilation operators of the system and the environment respectively. 
Here, a single mode (harmonic oscillator) for the environment is sufficient to model the 
dynamics of interest, x is a coupling constant. 



We can obtain a master equation by substituting Eq. (A.3) into the simplified master 



equation Eq. (|A.2|) in Section AA 



p = — — {a}ap + pa}a — 2apa}) — Xnh{a'^ap + paa^ — 2apa) — 2a^ pa) , (A. 4) 

where fih = {b'^b), and A = For amplitude damping with an environment at tem- 

perature ksT much smaller than the system's energy scale, we can set n;, = 0, resulting in 
the master equation: 

p = —^{a} ap + pa^ a — 2apa)) . (A. 5) 

Writing p{t) = J2m,n Pmn\i^){n'\ where the time dependence of pmn has been suppressed for 
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simplicity, we obtain the system of equations: 



^ [Pmnin + m) - 2y/m + iVn + lp(. 



m+l){n+l) 



\m){n\ 



(A.6) 



Equation ( |A.6| ) can be solved to obtain p{t) which is a complete description of the dy- 
namics. The result is best expressed in the operator sum representation, where p{t) = 
j:kMt)p{0)Al{t) and 



n \ 



n 



y^(l -7(t))"-^7(t)'=|n - k){n\ , 



where j{t) = 1 — e 



(A.7) 



Phase damping 

The interaction Hamiltonian is: 

Using the simplied master equation, 



(A. 



where {n'^)b = Tr pb{b^b)'^ . Substituting p{t) = Y.mn Pmn\m) {n\ 



Ypmn\m){n\ = -^^pmn{m - n)'^\m){n\ 

mn mn 



(A.9) 



(A.IO) 



which can be solved to obtain pmn{t) = e 2 (™ ") pmn(O). 
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Unital processes 



In this appendix, we investigate the possibihty to represent unital processes by random 
unitary processes. We use the linear representation of unital quantum operations, and 
consider the complete positivity of such representation. We will show that unital processes 
on a qubit can be represented as random unitaries. For a higher dimensional system, we 
will show that there are unital processes that are not random unitaries. 

The results were obtained in collaboration with A. Childs and X. Zhou (unpublished). 
Related results were independently reported in |64, 99 1. 



B.l Definitions and Facts 

Definition 9: A quantum operation £ of the form £{p) = YlkPkUkpU'l, where 
J2kPk = 1) is called a random unitary process. 



Definition 10: A quantum operation 8 is unital if = I. A trace-preserving 
unital quantum operation is called doubly stochastic. 

A random unitary process is doubly stochastic. To investigate the converse, we consider 
the affine representation of quantum operations. 
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B.2 AfRne and linear representations of quantum operations 

Since quantum operations are linear maps on density matrices, they can be represented by 
matrices and vectors respectively. These matrices and vectors are real if the basis for density 
matrices is hermitian. We consider an orthonormal hermitian basis for density matrices, 
with an element proportional to the identity, and all other elements traceless. We call the 
traceless component the generalized Block vector. When the identity component is chosen 
to be the first coordinate, the matrix representation of a quantum operation £ is given by 



Mo 










(B.l 


V2 


M 





where Mq and M are respectively 1x1 and {d? — 1) x (d^ — 1) matrices for a d-dimensional 
system, Vi and V2 are d"^ — 1 row and column vectors. If £ is trace-preserving, Mq = 1 
and Vi = 0. If f is unital, Mq = 1 and V2 = 0. It follows that £ is doubly stochastic if 
and only if Mq = 1 and the generalized Bloch vectors form an invariant subspace under 
£, in which case M governs the dynamics, and we call M a linear representation of iS. Q 
The linear representation is unique. Composition of quantum operations corresponds to 
matrix multiplication in the linear representation. A matrix represents a (doubly stochastic) 
quantum operation if and only if it defines a completely positive map. 

B.2.1 Complete positivity in the linear representation 

To-date, there is no simple characterization of complete positivity for the linear representa- 
tion. In this section, we present a characterization in the qubit case. Before the discussion on 
complete positivity, we elaborate on the distinction between positive maps and completely 
positive maps. 

non-unital trace-preserving process can be represented by an afRne map on the generalized Bloch 

vector. 
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Positivity vs complete positivity 



Recall from Section 2.3.1 that a mapping £ is completely positive, if for any ancillary Hilbert 
space R, £ <^ Zr > 0, where Ir is the identity operation on R. Physically, i? is a reference 
system. We say that £ is incompletely positive if £ is positive but not completely positive; 
in other words, when there exists a reference system R and a witness p > in the joint 
system such that {£ ®Zr){p) ^ 0. We use "CP" and "ICP" as shorthand notations for 
"completely positive" and "incompletely positive". We first consider some examples in the 
qubit case: 

• Any unitary operation is CP (since an operator sum representation exists). 

• The depolarizing channel (see Section ^ ), defined by T>p{p) = (1 —p)lpl + ^{XpX + 
YpY + ZpZ) for < p < 1, is CP. The set of depolarizing channels is closed under 
composition. Denote ^{I + f ■ a) by p(r). It follows that 



VMrl) = p{{^-^)r) , (B.2) 



and ° ^pi — where p = Pi + P2 — ^ P1P2 is always in [0, 1]. 

• The transposition, T{H) = , is ICP. Any inseparable state is a witness |91]. 

• The inversion operation, X^{p{r)) = p{—f), is ICP. It can be shown in two different 
ways. Note that = T oy = y oT where y{p) = YpY. If /) is a witness for T to be 
ICP, YpY is a witness for Toy because it is the preimage of p under 3^. Alternatively, 
p is a witness for 3^ o T because y preserves eigenvalues. 

The last example can be generalized. If A is ICP with witness p, and U is a unitary 

operation, U o A and A o U are both ICP, with respective witnesses p and pU. This 

invites the question, are £ o A and Ao £ ICP for a general quantum operation £1 

We consider the example o T>p{= Dp oZ^). As the range of T>p is smaller than the set 

of all possible states, it is unclear if all the witnesses of X^, are excluded in the range of Pp, 
An inseparable state cannot be written as a convex sum of product states over the composite system. 
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making ly o Dp CP. We now show that ly o T>p is ICP when p < ^ and is CP when p > ^. 
Suppose p < ^. Consider the Bell state \^~^) = "^(|00) + with density matrix 

= + XX -YY + ZZ). 



/I + (l - y ) {-XX + YY - ZZ) 

^11 + (i - ^) (// - XX + yy - ZZ) 



(B.4) 
(B.5) 



The minimum eigenvalue of {II—XX+YY—ZZ) is —2. Therefore, the minimum eigenvalue 
of ((T^oPp)® J^)(|$+)($+|) is -2(l-f)] = p-l <0. Hence, T„ o is ICP. 

Suppose p > 5, 

{ly o Vp){p{f)) = p (- (l - ^) rj = Vp,{p{f)) , (B.6) 



where = ^—p G [^,1]. Therefore, XyoVp is CP. It is interesting that "over-depolarization" 
(p > I ) resembles "depolarization-inversion" . 

This counter-example shows that no general statement can be made on the composition 
of a CP map with an ICP map. We now proceed to the characterization of complete 
positivity in the linear representation. 



Necessary and sufficient condition for complete positivity 



Let Pg- denote the generalized random Pauli channel (see Section |2.4D 



T^qip) = QoP + QxXpX + QyYpY + QzZpZ . 



(B.7) 



Vg' is trace-preserving iE qq + Qx + Qy + Qz = ^ and Pg- is completely positive iff > for 
i = 0, x,y, z. To see why complete positivity of Pg- implies qi > 0, consider 



(P,-^Jr)(|$+)($+|) = qo\<^+){<l>+\+q.XI\<^+){<^+\XI 

+ qyYI\<^+){^+\YI + qzZI\<i>+){<^+\ZI 



(B.8) 
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Since X/|<i>+), y/|<I>+), and ZI\^+) are orthonormal, the eigenvalues of {V^ (g) 

Xfl) )(<!>+ 1) are precisely qi for i = 0,x,y,z, which have to be non-negative if Pg- is 
completely positive. When 2?^- is trace-preserving, P^- is completely positive iff g G A^, the 
simplex with vertices (0,0,0), (1,0,0), (0,1,0), and (0,0,1). 

We now consider when a real 3x3 matrix is a linear representation for a doubly stochastic 
process choosing {I, X, iY, Z} to be the basis for density matrices. We first consider diagonal 
matrices and then general matrices. 

Lemma 4- D = Diag(di, ^2, da) is the linear representation of a doubly stochastic 
quantum operation iff (di, d2, d^) £ A^, the simplex in R'^ with vertices (1, 1, 1), 
(1,-1,-1), (-1,1,-1), and (-1,-1,1). 

Proof: It can easily be verified that D is the linear representation of a mapping 
given by Pg- in Eq. ( |B.7] ) where the parameters are related by the non-singular 
linear transformation 

di = l-2qy-2q, (B.9) 
d2 = l-2q,-2q, (B.IO) 
ds = l-2q^-2qy (B.ll) 

Pg- is completely positive iff g G Aq iff (di, ^2, (is) G A^. 

Theorem 8: Let M be a real 3x3 matrix. M is the linear representation 
of a doubly stochastic process iff M = O1DO2 for some Oi G 50(3) and 

= Diag((ii, ^2, (is) with (di, (i2, ds) £ Ad- 
Proof: Since M is real, M has a singular value decomposition M = O1DO2 
where each Oi is orthogonal. We can rewrite M = O1DO2 where Oi = det(Oj) Oi 
is in 50(3) and D = det{0i02)D. Each Oj G 50(3) represents a unitary oper- 
ation Z//j. If D G Ad, it represents a quantum operation Dg-, and M represents 
Ui o T>g'oU2 which is a quantum operation. Conversely, if M represents a quan- 
tum operation, D = 0\m02 also represents a quantum operation. By lemma ^ 
[di,d2,dz) G Ad. 
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Note that D is unique up to negating two entries simultaneously. One can verify that A^^ 
is indeed invariant under the simultaneous negation of two entries. 

B.3 Unital processes as random unitary processes 

The following result is now immediate: 

Theorem 9: Any unital process on a qubit is a random unitary process. 
Proof: Let M be the linear representation of £. From Theorem Af = O1DO2 
where Oi £ 5*0(3) and D G A^^. Oj and D respectively represent some unitary 
operations Ui{p) = UipUj and V^. Therefore, 

£{p) = UioV^oU2{p) (B.12) 
= qo{UiU2)p{UiU2)^ + qAUiXU2)p{UiXU2)^ (B.13) 
+ qyiUiYU2)p{UiYU2)^ + q.iUiZU2)p{UiZU2)^ (B.14) 

which is a random unitary process. 

The above proof provides a constructive method to find the unitary operation elements, as 
well as a circuit for realizing the unital operation: 



P U2 



Z 



X 



Ui — £{p) 



Figure B.l: A circuit model for an arbitrary unital quantum operation. The environmental 
state is given by |e) = y/qo\00) + y/q^\Ol) + ^/gl|10) + ^/q^\ll). 



B.4 Unital processes for higher dimensional systems 

Linear representations for quantum operations on a d-dimensional system are given by real 
(d? — 1) X [d? — 1) matrices. Most of the arguments for the qubit case can be generalized. 
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For example, one can generalize the random Pauli channel to T>^{p) = X^a,. QkEkpEl where 
JlkQk = 1) -E-A; are unitary, and the states (Ef^ (gi I)J2i are orthogonal. For instance, 
Ek can be chosen to be the nice error basis in |7p]. The complete positivity of T)^ is 
still equivalent to the condition qk > V/c. One can likewise deduce the criteria for a 
diagonal real {cP — 1) x (d? — 1) matrix to be a completely positive map. A general real 
(d^ — 1) X [cP — 1) matrix still has a singular decomposition in terms of special orthogonal 
and diagonal components. However, a matrix in SO{d? — 1) may not correspond to any 
unitary operation. In fact, there are {cP — l){d? — 2)/2 free parameters in SO{(P — 1) but 
only d? — 1 free parameters in SU{d), and a 1-1 correspondence between the two sets is 
impossible. 

B.4.1 Explicit counter-examples 

We now construct unital processes that are not random unitary processes. The set of all 
doubly stochastic processes on a fixed Hilbert space is a convex set. Likewise the set of all 
unital processes on a fixed Hilbert space is also convex. A point x in a closed convex set S is 
extreme if it cannot be expressed as a non-trivial convex combination of elements in S. It 
follows that an extreme point which is not a unitary operation cannot be a random unitary 
process (which contradicts extremality). Moreover, a doubly stochastic process which is 
extreme in the set of all unital processes is extreme in the set of all doubly stochastic 
processes. The extreme points in the set of all unital processes are well characterized by 
theorem 5 in |28]. The special case for our purpose can be stated as: 

Theorem 10: The set of completely positive unital linear maps on a d-dimensional 
system has extreme points of the form £{p) = '}2k -^kP^k where {^fc^j^} is a lin- 
early independent set. 

Corollary 1 : Consider the map on a qutrit: 

£{p) = {AipA\ + A2pA\ + A^pA\) (B.15) 

^ A set S inside a vector space is convex if, Vx, y G S, ax + (1 — a)y £ S \/ a £ [0, 1]. 
In other words, x is extreme if V < a < 1 y, a G S, a; = ay + (1 — a)z ^ x — y = z. 
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where Aj are related to the Gell-Mann matrices A,: 



= Ai = 



V2 



10 

1 




^2 = A4 = 



V2 



1 



1 



A3 = Ae = 



V2 




1 
10 



It is straightforward to verify that £ is doubly stochastic and {vlfeAj} is linearly 
independent and therefore is not a random unitary process. 

For a system of dimension > 3, a doubly stochastic process which affects a three 
dimensional subspace according to £ and leaves the orthogonal complement 
invariant is likewise extreme, and cannot be a random unitary process. 



Appendix C 



Miscellaneous 



C.l Upper bounds for n 

An argument for c ~ 1 for large n is presented using Paley's construction (mentioned in 
Section 7.3.4), known results on primes in intervals and the prime number theorem for 



arithmetic progressions. 

Let 7r(x) be the number of primes p which satisfy 2 < p < x. For x > 67, x/(logx — 
1/2) < 7r(x) < x/(logx— 3/2) ||9^. It follows that there exists a prime between n and n(l+e) 
for e > 2/logn. Applying Paley's construction, H(p + 1) or H{2{p + 1)) exists depending 
on whether p = 3 mod 4 or p = 1 mod 4. Therefore, n < n(l + e) + 1 or n < 2(n(l + e) + 1) 
respectively. 

The worse of the upper bounds n < 2(n(l + e) + 1) resulting from p = 1 mod 4 can be 
improved. Note that there are at least r primes between n and n(l + e)''. If the primes 
that equal 3 mod 4 and 1 mod 4 are randomly and uniformly distributed, the probability 
to find a prime which equals 3 mod 4 between n and n(l + e)*" is larger than 1 — 2"''. This 



assumption is true due to the prime number theorem for arithmetic progressions [43|. Let 
Tr{x, a, q) denotes the number of primes p in the arithmetic progression {a, a + q,a + 2q, . . .} 
which satisfy 2 < p < x. It is known that tt{x, 3, 4) ~ tt{x, 1,4). Therefore, with probability 
larger than 1 — 2~^', n < n(l + e)^' + 1, implying c < (1 + e)*" + 1/n ~ 1 for large n. 
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C.2 Mixed state description of the two-bit code 

Recall that the initial state after the ancilla preparation is given by pq = <Si {I + o'z)/2 
(see Eq. (|8.29|) , with oja omitted). After Rya{0), the density matrix is given by 



pi = (cos 0az + sin 9ax) {I + az)/2 . 



(C.l) 



Without coding, phase damping changes the density matrix to 



control 
P5 



COS 6az + {I — 2pa) sin 9ax ^{I + az)/2 



(C.2) 



With coding, the encoding, phase damping, and decoding change the density matrix to ps, 
P4, and P5: 



coded 



coded 

Pa 



coded 
P5 



= COS 9{az + (Tj^ ax)/2 

+ sme{ax(S) I + I (S> dy) /2 (C.3) 

= cos6{az + (1 - 2pa){l - 2pb)(Jy ax)/2 

+ sme{{l -2pa)ax(S) I + {I -2pb)I^ ay) /2 (C.4) 

= coseaz0{I + {l-2pa){l-2pb)az)/2 

+ smeax'^{{l-2pa)I+{l-2pb)az)/2 (C.5) 

= COsOaz'^ {1 - Pa- Pb + 2paPb){I + (^z) + {Pa+ Pb- 2paPb){I - CTz) /2 

+ Smeaa:^\{l-Pa-Pb){I + (^z) + {-Pa+Pb){I-(^z)\/2 (C.6) 



cos 6(1 -Pa-Pb + 2paPb)<7z + sin6'(l -pa- Pb)(^x 



{I + Oz)/2 



+ cos 6{pa +Pb- 2paPb)(^z + sin 0{-pa + Pb)(Tx ® (/ - (rz)/2 



(C.7) 
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C.3 Two bit code experiment with very different T2's 



While the case of equal T2's is interesting from a theoretical standpoint, different spins in 
a molecule typically have quite different T2's. To study the two-bit code in this regime, 
we performed experiments with carbon-13 labeled chloroform dissolved in acetone 29|. 
All parameters were similar to the sodium formate sample, except for the relaxation time 
constants. 

In the chloroform experiment, Ti's were 16 s and 18.5 s and T2's were 7.5 s and 0.35 s 
for proton and carbon respectively. Separate experiments with the ancilla dephasing much 
slower or faster than the input were performed by interchanging the roles of proton and 
carbon. T|'s and t^'s were as listed in Q. The ellipticities are shown in Fig. |C.l| . 

a b 





Storage time [ms] 



Storage time [ms] 



Figure C.l: Ellipticities obtained in the chloroform experiments, with (a) proton and (b) 
carbon as the ancilla. Carbon dephases much faster than proton. Error bars represent 95% 
confidence level. 



Prom Fig. C.l a, it is apparent that coding almost removes the distortion entirely when 



a much better ancilla is available. The question is, is coding advantageous over storing 
in the good ancilla alone? Theoretically, coding is always advantageous because the error 



probability is always reduced from pi {i being the input spin) to PaPb- Fig. C.l b shows that 
experimentally, such improvement is marginal, because the advantage of coding is offset by 
the noise introduced. Therefore, when the T2's are very different, the bottle neck is the 
dephasing of the bad qubit. 
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